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NOVEL 3' TERMINAL SEQUENCE OF HEPATITIS C VIRUS GENOME 
AND DIAGNOSTIC AND THERAPEUTIC USES THEREOF 

GOVERNMENTAL SUPPORT 

5 

This work was supported by National Cancer Institute/National Institutes of Health grant 
. number CA57973. The government may have certain rights in this invention. - — ~ 

TECHNICAL FIELD OF THE INVENTION 

10 

The present invention relates generally to a novel nucleotide sequence element identified 
at or near the 3' terminus of the hepatitis C virus (HCV) viral genome RNA. This 
element is highly conserved among HCV genotypes and may be essential for HCV 
replication. 

15 

BACKGROUND OF THE INVENTION 

After the development of diagnostic tests for hepatitis A virus and hepatitis B virus, an 
additional agent, which could be experimentally transmitted to chimpanzees (Alter et al, 
20 1978; Hollinger et al, 1978; Tabor et al, 1978), became recognized as the major cause 
of transfusion-acquired hepatitis. cDNA clones corresponding to the causative non-A 
non-B (NANB) hepatitis agent, called hepatitis C virus (HCV), were reported in 1989 
(Choo et al, 1989). This breakthrough has led to rapid advances in diagnostics, and in 
our understanding of the epidemiology, pathogenesis and molecular virology of HCV 
25 (see Houghton et al, 1994 for review). Evidence of HCV infection is found throughout 
the world and the prevalence of anti-HCV antibodies ranges from 0.4-2% in most 
developed countries to more than 14% in Egypt (Hibbs et aJ, 1993). Besides 
transmission via blood or blood products, or less frequently by sexual and congenital 
routes, sporadic cases, not associated with known risk factors, occur and account for 
30 more than 40% of HCV cases (Alter et al, 1990; Mast and Alter, 1993). Infections are 
usually chronic (Alter et al, 1992) and clinical outcomes range from an inapparent 
carrier state to acute hepatitis, chronic active hepatitis, and cirrhosis which is strongly 
associated with the development of hepatocellular carcinoma. Although alpha IFN has 
been shown to be useful for the treatment of some patients with chronic HCV infections 
35 (Davis et al, 1989; DiBisceglie et al, 1989) and subunit vaccines show some promise in . 

the chimpanzee model (Choo et-al,-1994),-future efforts-are-needed-to develop-more 

effective therapies and vaccines. The considerable diversity observed among different 
HCV isolates (for review, see Bukh et at, 1995), the emergence of genetic variants in 
chr nically infected individuals (Enomoto et al, 1993; Hijikata et al, 1991; Kato et al. 
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It has been speculat d that HCV may also replicat in extra hepatic reserv iris}, 
particularly in chr nically infect d individuals. In some cases, RT/PCR or in situ 
hybridization has shown an association of HCV RNA with peripheral blood mononuclear 
cells including T-cells, B-cells, and monocytes (Blight et al, 1992; Bouffard et al, 1992; 
5- Gil et al, 1993; Gunji et a!, 1994; Moldvay et al, 1994; Nuovo et al, 1993; Wahg"et al, 
1992; Young et al, 1993; Yun et al, 1993; Zignego et al, 1992). Such tissue tropism 
could be relevant to the establishment of chronic infections and might also play a role in 
the association between HCV infection and certain immunological abnormalities such as 
mixed cryoglobulinemia (reviewed by Ferri et al, 1993), glomerulonephritis, and rare 
10 non-Hodgkin's B-iymphomas (Ferri et al, 1993; Kagawa et al, 1993). However, the 
detection of circulating negative strand RNA in serum, the difficulty in obtaining truly 
strand-specific RT/PCR (Gunji et al, 1994), and the low numbers of apparently infected 
cells have made ft difficult to obtain unambiguous evidence for replication in these 
tissues in vivo. 

15 

Although a cell culture system capable of efficient HCV replication has not been 
developed, some progress has been made. Consistent with the in vivo observations 
mentioned above, in vitro HCV infection and short term replication have been reported 
for chimpanzee and human hepatocytes (Carloni et al, 1993; lacovacci et aJ, 1993; 

20 Lanford et al, 1994), a human hepatoma line (Huh7; Yoo et al, 1995, see below), 
peripheral blood leukocytes (Muller et al, 1993), a human B-cell line expressing EBV 
antigens (Bertolini et al, 1993), a mouse retrovirus-infected human T-cell line (Molt4-Ma; 
Shimizu et aJ, 1992), an HTLV-1 transformed human T-cell line (MT-2; Kato et al, 
1995), and fibroblasts derived from human foreskin (Zibert et al, 1995). Thus far, only 

25 a small fraction of these ceils appear infected, in vitro infectivrty of different HCV 

inocula using a permissive subclone of the Molt4-Ma T-ceU line correlates well with their 
in vivo infectivrty in the chimpanzee model (Shimizu et ai, 1993). This cell line has aJso 
been used to begin examining HCV binding and the possible emergence of neutralization 
escape mutants during chronic infection (Shimizu et al, 1994b). 

30 

Genome structure. Full-length or nearly fulMength genome sequences of 
numerous HCV isolates have been reported (see Lin et al, 1994; Okamoto et al, 1994; 
Sakamoto et al, 1994 and citations therein). Given the considerable genetic divergence 

among isolatesrit is-clear-that-several-major-HCV-genotypes-are distributed-throughout— 

35 the world (see below). Those of greatest importance in the U.S. are genotype 1, 

subtypes 1a and 1b. HCV genome RNAs are about 9.4 kilobases in length. The 5' NTR 
is 341-344 bases and is the most c nserv d RNA sequence element in the HCV 
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g nome. Th length of the long ORF varies slightly among isolates, encoding 
polyproteins f 3010, 301 1 or 3033 amino acids. The rep rted 3' NTR structures show 
considerable diversity both in composition and length (28-42 bases), and appear to 
terminate with poly (U) (for examples, see Chen et al, 1992; Okamoto et al, 1991; 
5 Tokita et al, 1994) except in one case (HCV-1, type 1a) which appears contain a~3 F 
terminal poly (A) tract (Han et al, 1991). 

Translation and proteolytic processing. Several studies have used cell-free 
translation and transient expression in cell culture to examine the role of the 5' NTR in 

10 translation initiation (Fukushi et al, 1994; Tsukiyama-Kohara et al, 1992; Wang et al, 
1993; Yoo et al, 1992). This highly conserved sequence contains multiple short 
AUG-initiated ORFs and shows significant homology with the 5' NTR region of 
pestiviruses (Bukh et al, 1992; Han et al, 1991). A series of stem-loop structures have 
been proposed on the basis of computer modeling and sensitivity to digestion by 

15 different ribonucleases (Brown et at, 1992; Tsukiyama-Kohara et al, 1992). Although 
still controversial (see Wang et al, 1993; Yoo et al, 1992), the results from several 
groups indicate that this element functions as an internal ribosome entry site (IRES) 
allowing efficient translation initiation at the first AUG of the long ORF (Fukushi et al, 
1994; Tsukiyama-Kohara et at, 1992; Wang et al, 1993). Some of the predicted 

20 features of the HCV and pestivirus IRES elements are similar to one another (Brown et 
al, 1992). ft has been proposed that the 5' terminal hairpin structure and the short 
ORFs may function to downreguiate translation (Yoo et al, 1992). The ability of this 
element to function as an IRES suggests that HCV genome RNAs may lack a 5' cap 
structure. 

25 

The organization and processing of the HCV poiyprotein appears to be most similar to 
that of the pestiviruses. At least 10 polypeptides have been identified and the order of 
these cleavage products in the poiyprotein is NH 2 -C-E1-E2-p7-NS2-NS3-NS4A-NS4B- 
NS5A-NS5B-COOH. Proteolytic processing is mediated by host signal peptidase and 

30 two HCV-encoded proteinases, the NS2-3 autoproteinase and the NS3-4A serine 

proteinase. C is a basic protein believed to be the viral core or capsid protein; E1 and 
E2 are putative virion envelope glycoproteins; p7 is a hydrophobic protein of unknown 
function that is inefficiently cleaved from the E2 glycoprotein (Lin et al, 1994; 
Mizushima et airT994rSelby et alr1994)rand NS2-NS5B-are likely nonstructural (NS) 

35 proteins which function in viral RNA replication complexes. In particular, besides its 
N-termina! serine proteinase domain, NS3 contains motifs characteristic of RNA 
h ticas s and has been shown to possess an RNA-stimulated NTPase activity (Suzich et 
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al, 1993); NS5B c mains th GOD motif charact ristic of the RNA-dependent RNA 
p lymerases of positiv -strand RNA viruses. 



Virion assembly and release. This process has not been examined directly, but 
5 the lack of complex glycans, the ER localization of expressed HCV glycoproteins" 
(Dubuisson et al, 1994; Ralston et al, 1993) and the absence of these proteins on the 
cell surface (Dubuisson et al, 1994; Spaete et al, 1992) suggest that initial virion 
morphogenesis may occur by budding into intracellular vesicles. Thus far, efficient 
particle formation and release has not been observed in transient expression assays, 

10 suggesting that essential viral or host factors are missing or blocked. HCV virion 
formation and release may be inefficient, with a substantial fraction of the virus 
remaining cell-associated, as found for the pestivtruses. A recent study indicates that 
extracellular HCV particles partially purified from human plasma do contain complex 
N-linked glycans, although these carbohydrate moieties were not shown to be 

15 specifically associated with E1 or E2 (Sato et al, 1993). Complex glycans associated 
with glycoproteins on released virions would suggest transit through the trans Golgi and 
movement of virions through the host secretory pathway. If this suggestion is correct, 
intracellular sequestration of HCV glycoproteins and virion formation might then play a 
role In the establishment of chronic infections by minimizing immune surveillance and 

20 preventing lysis of virus-infected cells via antibody and complement. 

Genetic variability. As for all positive-strand RNA viruses, the RNA-dependent 
RNA polymerase of HCV (NS5B) is believed to lack a 3'-5'exonuclease proof reading 
activity for removal of misincorporated bases. Replication is therefore error-prone 

25 leading to a "quasispecies" virus population consisting of a large number of variants 
(Martell et al, 1992; Martell et al, 1994). This variability is apparent at multiple levels. 
First, in a chronically infected individual changes in the virus population occur over time 
(Ogata et al, 1991; Okamoto et al, 1992) and these changes may have important 
consequences for disease. A particularly interesting example is the N-terminal 30 

30 residues of the E2 glycoprotein which exhibits a much higher degree of variability than 
the rest of the polyprotein (for examples, see Higashi et al, 1993; Hijikata et al, 1991: 
Weiner et al, 1991). There is accumulating evidence that this hypervariable region, 
perhaps analogous to the V3 domain of HIV -1 gp120, may be under immune selection 
by circulating antiviral antibodies (Kato et al, 1993; Taniguchi et al, 1993; Weiner et al, 

35 1992). In this model, antibodies directed against this portion of E2 may contribute to 
virus neutralization and thus drive the selection of variants with substitutions which 
escape neutralized n. This plasticity sugg sts that a specific amino acid s quence in 



WO 97/08310 



PCT/US96/14033 



15 



the E2 hypervariable region is not essentia! for th r functions of the protein such as 
virion attachment, penetration, or assembly. Genetic variability may also contribute to 
the spectrum of different responses observed after treatment of chronically infected 
patients with alpha IFN. Diminished serum ALT levels and improved liver histology, 
5. which is sometimes correlated with a decrease in the level of circulating HCV RNAHs 
seen in only -40% of those treated (Greiser-Wilke et al, 1991). After treatment, 
approximately 70% of the responders relapse. In some cases, after a transient loss of 
circulating viral RNA, renewed viremia is observed even during the course of treatment. 
While this might suggest the existence or generation of IFN-resistant HCV genotypes or 
10 variants, further work is needed to determine the relative contributions of virus 

genotype and host-specific differences in immune responsiveness. Finally, sequence 
comparisons of different HCV isolates around the world have uncovered enormous 
genetic diversity (reviewed in Ref. Bukh et al f 1995). Because biologically relevant 
serological assays such as cross-neutralization tests are lacking, HCV types (designated 
by numbers), subtypes (designated by letters), and isolates are currently being grouped 
on the basis of nucleotide or amino add sequence similarity. Amino acid sequence 
similarity between the most divergent genotypes can be as little as -50%, depending 
upon the protein being compared. This diversity is likely to have Important biological 
implications, particularly for diagnostics, vaccine design, and therapy. As mentioned 
earlier, genotypes 1a and 1b are most common in the U.S. (see Bukh et al, 1995 for a 
discussion of 

genotype prevalence and distribution). Recently, in Yoo et al (1995) T7 transcripts from 
various derivatives of an HCV-1 cDNA clone were tested for their ability to replicate by 
transfection of the human hepatoma cell line, Huh7. Possible HCV replication was 
25^assessed by strand-specific RT/PCR (using 5* NTR primers) and metabolic labeling of 

HCV-specific RNAs with 3H-uridine. Transcripts terminating with either poly (A) or poly 
(U), were positive b^these assays but those with a deletion of the 5' terminal 144 
bases were not. In some cutares^CV-specific RNA could be detected in the culture 
media and could be used to reinfect fresh.Huh7 cells. While these claims cannot be 
30 directly refuted, it seems likely that the author^are not actually detecting authentic 

HCV replication. For instance, the authors' positive controlwas productive transfection 
of Huh7 cells with RNA extracted from 1 ml of high HCV titer chirip^rrae plasma. This 
extracted sample would contain a maximum of 107 potentially infectious full-length 
HCV RNA molecules. Under optimum transfection conditions (other than 
microinjection), >105 RNA molecules of virion RNA (at least for poliovirus, Sindbis 
virus, or YR are typically required to initiate a single infectious event. This suggests 
that in the HCV-1 experiment fewer than 100 c lis would be productively transf cted. 
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At 16 days p st-transfection, both positive- and negativ -strand RNAs w r readily 
detected after 8 hours f metabolic labeling. The detect! n of n gative-strand RNA by 
this method (both for transfected virion RNA and transcript RNA) suggests that HCV is 
capable of both efficient replication and spread, and that the level of HCV RNA ^ 
5 synthesis is similar to that which would be expected for a more robust flavivirus. such 
as YF (at the peak of a high multiplicity infection). However, despite numerous 
attempts, the authors were unab.e to detect HCV antigens in these cells using a variety 
of antisera or full-length positive- or negative-strands by Northern analysis (which is 
much more sensitive than metabolic labeling with 3H-uridine» (J. Han, personal 
1 0 communication). To say the least, these results are perpiexing and not easily reconciled 
with authentic HCV replication. Finally, the critical experiment, demonstrating that RNA 
or virus derived from the HCV-1 clone is infectious in the chimpanzee modei. has not 
been reported (despite the initial presentation of this work at a meeting more than two 
years ago). Work in other RNA virus systems has shown that specific terminal 
1 5 sequences can be critical for the generation of functional, replication competent RNAs 
(reviewed in Boyer and Haenni. 1994). Such sequences are believed to be involved In 
initiation of negative- and positive-strand RNA synthesis. In some cases, a few 
additional bases, or even longer non-viral sequences, are tolerated at the 5' and 3' 
termini; these sequences are typically lost or selected against during authentic viral 
20 replication. For other RNA viruses, extra bases, particularly at the 5' terminus, are 
deleterious (Boyer and Haenni. 1 994). In contrast, except in a few cases, transcripts 
lacking authentic terminal sequences are non-functional (Boyer and Haenni. 1994). For 
instance, deletion of the 3' terminal secondary structure or conserved sequence 
elements in the 3' NTR of flavivirus genome RNA is lethal for YF (P. J. Bredenbeek and 
25 C.M.R., unpublished) or TBE (C. Mandl. personal communication) RNA replication. 
Given the Importance of these sequence elements for other viruses, ft is dear that a 
more rigorous determination of the HCV terminal sequences needed to be mede. 

SUMMARY OF THF INVENTION 
30 In view of the aforementioned deficiencies attendant with prior art HCV cDNA clones 
and cell culture systems for the analysis of HCV replication, and for the development of 
therapeutic compositions therefor, it is evident that there exists a need in the art for 

identific ation of particularly the 3' terminal sequence of HCV which can be incorporated 

into a full-length cDNA clone capable of yielding infectious RNA transcripts, which then 
35 can be used as target sequences for the production of attenuated HCV for vaccines, 
and which can be used as targ ts for th rapeutic c repositions. 
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In accordance with the present inv nti n, nucleotide sequenc s derived fr m clon d 
cONA are provided which encode an HCV 3' terminal RNA element. The newly 
discovered 3' element in particular is highly conserved among HCV genotypes, and is a 
general feature of the HCV RNA genome. 
5- — 
The present invention includes a poly (UC) tract followed by a 101 nucleotide 3' 
terminal RNA sequence element, and to full-length HCV viral genome RNA or derived 
HCV RNA replicons containing the following sequence: 
5'- poly (UC) - AAUGGUGGCUCCAUCUUAGCCCUAGUCACGGCUA 
1 0 GCUGUGAAAGGUCCGUGAGCCGCAUGACUGCAG A 

GAGUGCUGAUACUGGCCUCUCUGCUGAUCAUGU -3' (SEQ ID NO:1). 

The present invention also relates to the complement of this RNA sequence and to full- 
length HCV negative-sense RNAs containing the following sequence: 
15 5 # - 

ACAUGAUCAGCAGAGAGGCCAGUAUCAGCACUCUCUGCAGUCAUGCGGCUCACGGAC 
CUUUCACAGCUAGCCGUGACUAGGGCUAAGAUGGAGCCACCAUU-poly (GA) - 3 # (SEQ 
ID NO:2). 

20 The present invention also relates to the DNA sequence corresponding to the 3' NTR 
element in the positive-sense HCV genome RNA: 
5' - poly - (TO - 

AATGGTGGCTCCATCTTAGCCCTAGTCACGGCTAGCTGTGAAAGGTCCGTGAGCCGCAT 
GACTG CAG AG AGTG CTG ATACTGGCCTCTCTG CTG ATCATGT-3 * (SEQ ID NO:3). 

25 

The present invention also relates to the DNA sequence corresponding to the 
complement of the 3' NTR element present in negative-sense HCV RNA: 
5' - 

ACATGATCAGCAGAGAGGCCAGTATCAGCACTCTCTGCAGTCATGCGGCTCACGGACC 
30 TTTCACAGCTAGCCGTGACTAGGGCTAAGATGGAGCCACCATT-poly (GA)-3' (SEQ ID 
NO:4). 

It should be appreciated, that although this sequence appears to "noncoding," it is 
possible that the sequence encodes a polypeptide of importance foTHCV ~ 
35 replication. There are two short open reading frames in the complement of the 3' 

terminal el ment which w uld be at the 5' f th negative-sens RNA followed by poly 
(A). Thes sequences could be expressed via translation of the 
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negativ strand RNA ( ither a full-l ngth negative-strand or a subgenomic 
RNA). 

In a further embodiment, this element, which extends beyond the previously accepted 
5 homopolymer tracts of polylU) or poly(A), can be used to assemble full-length HC\T 
cDNA clones for HCV-H and other HCV isolates (genotypes, types and subtypes) (HCV- 
1 acc.* M62321; HC-J1 acc.# D10749; HC-J acc.# D90208; HCV-BK acc.# M58335; 
HCV-H acc.# M67463; HC-J6 acc.# D00944; HC-J8 acc.# 001221; HC-J483 ace* 
D13558; HC-J491 acc.# D10750; HC-C2 acc.# D10934; HCV-JK acc.# X61596; HCV- 
1 0 N acc.# S62220; HCV-T acc.# M84745; HCV-JT acc.# D01 1 71 ; HCV-JT acc.# 

D01172; HC-G9 acc.# D14853; HCV-K3a acc.# D28917; NZL1 acc.# 017763; HCV-Tr 
acc.# D2655G). Such fulNength viruses or derived HCV RNA replicons are useful for 
the study of HCV replication and virus-host interactions as well as for the development 
of screening assays for HCV therapeutics and the evaluation of therapeutic compounds. 

15 

In a still further embodiment, live-attenuated strains of HCV are provided for use as 
vaccines. 

The present invention also relates to a recombinant DNA or RNA molecule or a 
20 degenerate variant thereof, which encodes the 3' HCV terminal sequence element; 

preferably the nucleic acid contains a sequence with substantially the same nucleotide 
sequence as SEQ ID N OS: 1-4 or that shown in FIGURE 3 (SEQ ID N0S:20-24; the parts 
of sequences downstream of the poly (U) tract only), FIGURE 6 (SEQ ID NOS:28-31) 
and FIGURE 8 (SEQ 10 NOS:33-36) and sequences forming a secondary structure similar 
25 to that of Figure 4 (SEQ ID N0:25), or alternatively, a structure which may be formed 
via interaction of the 3 # terminal sequence element (or its compliment) with other HCV 
RNA sequences (e.g., 5' terminal sequences or other sequences at or near the 3' NTR). 

The sequences of the HCV of the present invention or portions thereof, may be 
30 prepared as probes (or primers for RT-PCR) to screen for complementary sequences and 
related clones in the same or alternate species. The present invention extends to 
probes or primers so prepared that may be provided for screening cDNA libraries, 
plasma or infected cells for HCV. For example, the probes may be prepared 

synthetically and by recombinant DNA with a variety of known vectors, such~as~the 

35 phage, plasmid or viral vector. The present invention also includes the preparation of 
plasmids including such vectors, and the use of the DNA/RNA s qu nces t construct 
v ct rs expressing antisense RNA r ribozymes which would attack natural or 
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engin er d HCV RNAs containing any r all of th sequences set forth in FIGURES 3 
(SEQ ID NOS:20-24), FIGURE 6 (SEQ ID N0S:28-31) and FIGURE 8 <SEQ ID NOS.33- 
36) or as above as SEQ ID NOS:1-4, derivatives of the sequences, or homologous 
sequences from other HCV types/subtypes. Correspondingly, the preparation of 
5. antisense RNA and ribozymes are included herein. ~ - — ~ 

The present invention also includes RNA molecules having the properties noted herein, 
and that display the sequences set forth and described above and selected from SEQ ID 
N0S:1-4, 20-24, 28-31 and 33-36. 

10 

In a further embodiment of the invention, the full nucleotide sequence of the HCV 
containing the above determined sequences may be introduced into an appropriate host. 
The invention accordingly extends to host cells transfected or transformed with the 
cloned HCV sequences and/or RNA derived therefrom, and more particularly, replication 
15 competent and/or complete DNA/RNA sequences assembled using the sequences, or 
homologous derivatives set forth above* 

According to other preferred features of certain preferred embodiments of the present 
invention, a transiently transfected or stable cell line is provided to produce infectious 
20 HCV, and attenuated strains of HCV. 

The present invention naturally contemplates several means for preparation of the HCV 
sequences, including as illustrated herein known recombinant techniques, and the 
invention is accordingly intended to cover such synthetic preparations within its scope. 

25 The isolation of the sequences disclosed herein facilitates the reproduction of not only 
the nucleic acid sequences themselves, but also infectious HCV, and attenuated HCV by 
such recombinant techniques, and accordingly, the invention extends to the wild type 
and attenuated HCVs so prepared from the disclosed sequences, and to transiently 
transfected cells or stable cell lines expressing this sequence, replicating HCV RNA, 

30 and/or producing virus. 

The invention includes an assay system for screening of potential drugs effective to 
modulate replication of HCV in target cells by interrupting or potentiating the viral life 
cycle. Potentiation would be desirable where stocJcs~of"HCV~were to be producedHor — 
35 use in experimental as well as therapeutic regimes (i.e., vaccines). In one instance, the 
test drug could be administered to a cellular sample transfected with an infectious HCV, 
cDNA done or replicati n-compet nt RNA, to determine its effect upon the replicative 
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activity of the HCV in the presence of any chemical sample (including DNA or RIM A), or 
to th test drug, by comparis n with a contr I. 

The assay system could more importantly be adapted to identify drugs or other entities 
5 that are capable of binding to the HCV RNA sequences or which bind essentiaf factors 
interacting with these sequences, thereby inhibiting or potentiating replication. Such 
assays would be useful in the development of drugs that would be specific against a 
wide range of HCV isolates, due to the conservation of an important 3' terminal 
sequence motif, identified by SEQ ID NOS:1-4. 

10 

In yet a further embodiment, the invention contemplates antagonists of the activity of 
HCV, in particular, an agent or molecule that inhibits viral replication or transcriptional 
activity in general. In a specific embodiment, the antagonist can be an oligonucleotide 
having the sequence (or its complement) of a portion of a 3' terminal domain of an 
1 5 HCV. Such oligonucleotides may be capable of disrupting strand synthesis required for 
viral replication, translation of HCV RNA into protein, or packaging of genome RNA Into 
virus particles. 

The diagnostic utility of the present invention extends to the use of the present 3' 
20 terminal sequence in assays to screen for HCV infection. In particular, probes or PCR 
primers may be produced which are capable of detecting HCV infection In blood, or In 
infected cells. Such probes may be labelled with any detectable label. In the instance 
where a radioactive label, such as the isotopes *H, 14 C, ,a P, 15 S, "CI, 61 Cr, "Co, "Co, 
"Fe, *°Y, 12S I, U1 l, and ,aa Re are used, known currently available counting procedures 
25 may be utilized. In the instance where the label is an enzyme, detection may be 
accomplished by any of the presently utilized colorimetric, spectrophotometric, 
fluorospectrophotometric, amperometric or gasometric techniques known in the art. 

The present invention includes an assay system which may be prepared in the form of a 
30 test kit for the quantitative analysis of the extent of the presence of the HCV 

sequences, or to identify drugs or other agents that may mimic or block the activity of 

such sequences. The system or test kit may comprise a labeled component prepared by 
one of the radioac tive and/or enzymatic techniques discussed herein, coupling a label to 

a probe for HCV nucleic acid, or a binding partner thereof, or a binding partne7~ortfie 

35 HCV virion itself, and one or more additional immunochemical reagents, at least one of 

which is a fre r immobiliz d iigand, capable either f binding with the labeled 
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compon nt, its binding partn r, on of the c mp n nts t be det rmined r their binding 
partner(s). 

In a further embodiment, the present invention relates to certain therapeutic methods 
5 which would be based upon the activity of the HCV sequences(s), or active fragfneTits 
thereof, or upon agents or other drugs determined to possess the same activity. A first 
therapeutic method is associated with the prevention of infection by HCV, in particular 
by providing a vaccine composed of an attenuated HCV, designed by mutating the 
sequence elements disclosed herein. 

10 

More specifically, the therapeutic method generally referred to herein could include the 
method for the treatment of hepatitis or other cellular dysfunctions caused by HCV by 
the administration of pharmaceutical compositions that may comprise effective 
inhibitors of the HCV or its subunhs, or other equally effective drugs developed for 
1 5 instance by a drug screening assay prepared and used in accordance with a further 
aspect of the present invention. For example, drugs or other binding partners to the 
HCV nucleic add or its encoded proteins, may be administered to inhibit or potentiate 
transcriptional activity. 

20 In particular, HSV or its herein-identified 3' sequence element or fragments thereof, and 
binding partners thereto could be prepared in pharmaceutical formulations for 
administration in instances wherein interferon therapy is appropriate, such as to treat 
chronic viral hepatitis or other HCV-associated illnesses. 

25 Accordingly, it is a principal object of the present invention to provide a novel 3' 

sequence element of HCV, as well as full-length HCV genomes which encode wild-type 
or attenuated HCV bearing this additional sequence. 

It is a further object of the present invention to provide a method for detecting the 
30 presence of the HCV in mammals in which HCV is suspected to be present. 

It is a further object of the present invention to provide a method and associated assay 
system for screening substances such as drugs, agents and the like, potentially 
effective in combating the adverse effects of the HCVTn mammals: 



35 
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It is a still further object f the pres nt invention to provide a method for th tr atment 
of mammals t contr I the amount or activity of th HCV or fragments thereof, s as to 
alter the adverse consequences of such presence or activity. 

5. It is a still further object of the present invention to provide a method for the treatment 
of mammals to control the amount or activity of the HCV or its subunits, so as to treat 
or avert the adverse consequences of a pathological state. 

It is a still further object of the present invention to provide pharmaceutical 
10 compositions for use in therapeutic methods which comprise or are based upon the 
HCV, its sequence elements, their binding partner(s), or upon agents or drugs that 
control the production, or that mimic or antagonize the activities of the HCV. 

Other objects and advantages will become apparent to those skilled in the art from a 
1 5 review of the ensuing description which proceeds with reference to the following 
illustrative drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

20 FIGURE 1 depicts a method for determining the 3' terminal sequence of HCV. A 
phosphoryiated synthetic oligodeoxynudeotide Coligo A - ; 5'- 

GACTGTTGTGGCCTGCAGGGCCGAATT-3'; SEQ ID NO:5) was Ugated to the 3' end of 
the RNA to serve as a specific priming site for cDNA synthesis. A primer for cDNA 
synthesis Coligo B"; m 5-TTGAATTCGACCCTGCAGGCCACAACA-3'; SEQ ID NO:6 or 
25 B(2); 5*-TTGAATTCGGCCCTGCAGGCCACAACAGTC-3'; SEQ ID NO:7) was 

complementary to that used for ligation to the RNA; a second positive-sense primer for 
PCR corresponded to a sequence near the 3' end of the HCV ORF ("oligo C"; 5'- 
CAAGTCGACGGGGAGACATTTATCACAGC-3'; SEQ ID NO:8). 

30 FIGURE 2 depicts an alignment comparing the portion of the 3' terminal sequence (DNA 

corresponding to the positive sense HCV genome RNA) of HCV-H-AAK (SEQ ID NO:11; 

determined by sequencing of an undoned DNA fragment which was synthesized by PCR 
usin g an oltgo (dA) primer and "oligo CM between the termination codon of the ORF and 

the poly (U) tract with a partial list of published sequences forotherHCV-isolates — 

35 {genotypes): HCV-Hda) isolate (SEQ ID N0:9); HCV-1<1a) (SEQ ID NO:10); HCV-J1(1a) 

{SEQ ID NO:12); HCV-BK(1b) (SEQ ID NO:13); HCV-TW(1b) (SEQ ID N0:14); HCV- 
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N(1b) (SEQ ID NO:15); HCV-J6(2a) (SEQ ID NO:16); HCV-J8 (SEQ ID NO:17); HCV- 
NZL1{3a) (SEQ ID NO: 18); and HCV-Tr(3b) (SEQ ID NO:19). 

FIGURE 3 depicts the sequence (DNA corresponding to the positive sense HCV genome 
5. RNA) of HCV-H 3' clones including H77-*1,2 (SEQ ID NO:20), H77-#8 (SEQ ID NOf21). 
H77-#10 (SEQ ID NO:22), H77-#74 (SEQ ID NO:23), H77-#5 (SEQ ID NO:24). 

FIGURE 4 depicts the computer predicted (FOLD RNA, GCG package) secondary 
structure of the 3' end of HCV-H (SEQ ID NO:25). The last 46 nucleotides form a 
10 stable stem-loop structure (predicted energy -25 kCal/mol). 

FIGURE 5 depicts the scheme used for the RT/PCR amplification and cloning of partial 3' 
terminal segments from four different HCV subtypes (1b f 3, 4 and 4a) (SEQ ID NOS:28- 
31). Briefly, oligo D (5'-TAACATG ATCAGCAG AG AGGCCAG-3') (SEQ ID NO:26) was 
1 5 annealed to the 3' end of the genomic RNA, and cDNA was synthesized. Next, PCR 
was performed using oligo C (SEQ ID NO:8) and otigo E (5'- 

CTCACGGACCTTTCACAGC-3') (SEQ ID NO: 27). The PCR products were cloned and 
sequenced. 

20 FIGURE 6 shows an alignment of sequences (5'-3') (DNA sequence corresponding to the 
positive-sense HCV genome RNA) determined for 3 segments from HCV subtypes 1b, 3, 
4 and 4a amplified and cJoned as described in Figure 5. The termination codon (TGA) is 
shown in bold. 

25 FIGURE 7 depicts a scheme for 3' end oligonucleotide ligation, RT/PCR and cloning of 
the HCV RNA from four different HCV genotypes/subtypes (lb, 3, 4 and 4a). Methods 
were essentially as outline in Figure 1, except that oligo F (5'- 

CCAAGAATTCCCTAGTCACGGCTAGC-3') (SEQ ID NO;32) was used instead of oligo C. 

30 FIGURE 8 shows the sequences (DNA sequence corresponding to the positive-sense 

HCV genome RNA) (SEQ ID N0S:33-36) resulting from the analysis described in FIGURE 
7. Isolate-specific sequence differences are shown in bold. 



FIGURE 9 depicts a scheme for aftemative~3'^nd~primer iigationand RT/PGR-for-a 

35 genotype 4 isolate of HCV. A different oligonucleotide ("oligo G*; 5'- 

CGCACCCTGTCCGACTACAACATCC-3'; SEQ ID NO:37) was used for the RNA ligation 
step. Primers used for cDNA synthesis <" ligo H M ; 
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CAGAATTCTTGTAGTCGGACAGGGTGCG-3*; SEQ ID NO:38) and for PCR ("olig F" and 
"oligo H") are indicat d. 

FIGURE 10 depicts the predicted structure of a full-length HCV genome RNA (precise 
5 numbers refer to HCV-H). The genome RNA probably initiates with a G residue' 6nQ 
contains a 5' non-translated region (NTR) of 341 bases. The ORF consists of 9033 
bases encoding a polyprotein of 301 1 amino acid residues. Following the opal (UGA) 
stop codon is a sequence of 40 bases, a poly (U) tract, a polypyrimidine stretch, and a 
novel conserved RNA element of 101 bases. 

10 

DETAILED DESCRIPTION 

In accordance with the present invention there may be employed conventional molecular 
biology, microbiology, and recombinant DNA techniques within the skill of the art. Such 

15 techniques are explained fully in the literature. See, e.g., Maniatis, Frftsch & Sam brook, 
"Molecular Cloning: A Laboratory Manual" (1989); "Current Protocols In Molecular 
Biology" Volumes Mil [Ausubel, R. M., ed. (1994)]; "Cell Biology: A Laboratory 
Handbook" Volumes Mil [J. E. Celis, ed, (1994))]; "Current Protocols in Immunology" 
Volumes Mil [Coligan, J. E., ed. (1994)]; 'Oligonucleotide Synthesis" (M.J. Gait ed. 

20 1984); "Nucleic Acid Hybridization" (B.D. Hames & S.J. Higglns eds. (1985)]; 

"Transcription And Translation" [B.D. Hames & S.J. Higgins, eds. (1984)1; "Animal Cell 
Culture" [R.I. Freshney, ed. (1986)]; "Immobilized Cells And Enzymes" [IRL Press, 
(1986)]; B. Perbal, "A Practical Guide To Molecular Cloning" (1984). 

25 Therefore, if appearing herein, the following terms shall have the definitions set out 
below. 

The terms "3' terminal sequence element," "3' terminus," "3' sequence element," and 
any variants not specifically listed, may be used herein interchangeably, and as used 

30 throughout the present application and claims refer to nucleotide sequences having the 
sequence data described herein and presented in SEQ ID N0S:1-4 or that shown 
FIGURE 3 (SEQ ID NOS:20-24; the parts of sequences downstream of the poly (U) tract 

only)rFIGURE-6-(SEQLlD_NOS:28-31) and FIGURE 8 (SEQ ID N0S:33-36) and the profile 

of properties set forth herein and in the Claims. It should be appreciated that thTterms — 

35 "3' terminal sequence element," "3' terminus," "3' sequence element," are meant to 
enc mpass all of the following sequences: (i) an RNA sequence at the 3' terminus f 
the positiv -sense genome RNA; (ii) the complement of this RNA sequence at the 5' 
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terminus of the HCV negative-sense RNA; (iii) the ONA sequence c rr sponding t the 
positive-sense sequence f the RNA lement; and (iv) the DNA sequence c rr sponding 
to the negative-sense sequence of the RNA element. Examples of such sequences are 
illustrated in SEQ ID N0S:1-4, respectively. Accordingly, nucleotide sequences 
5 . displaying substantially equivalent or altered properties are likewise contemplated. — ~ 
These modifications may be deliberate, for example, such as modifications obtained 
through site-directed mutagenesis, or may be accidental, such as those obtained 
through mutations in hosts that are producers of the complex or its named subunrts. 
Also, the terms "3' terminal sequence," "3' terminus/ and "3' sequence element," are 
10 intended to include within their scope nucleic acid molecules specifically recited herein 
as well as all substantially homologous analogs and allelic variations. 

Any amino add residues described herein are preferred to be in the "L" isomeric form. 
However, residues in the "D" isomeric form can be substituted for any L-amino acid 

15 residue, as long as the desired functional property of immunoglobulirvbinding is retained 
by the polypeptide. NH 2 refers to the free amino group present at the amino terminus of 
a polypeptide. COOH refers to the free carboxy group present at the carboxy terminus 
of a polypeptide. In keeping with standard polypeptide nomenclature, J. BioL Chem., 
243:3552-59 (1969), abbreviations for amino acid residues are shown in the following 

20 Table of Correspondence: 



TABLE OF CORRESPONDENCE 



SYMBOL 


AMINO ACID 




1 -Letter 


3-Letter 




25 Y 


Tyr 


tyrosine 


G 


Gly 


glycine 


F 


Phe 


phenylalanine 


M 


Met 


methionine 


A 


Ala 


alanine 


30 S 


Ser 


serine 


1 


lie 


isoleucine 


L 


Leu 


leucine 


T 


Thr 


threonine 


V 


VaT~ ~ 


valine — 


35 P 


Pro 


proline 


K 


Lys 


lysine 


H 


His 


histidine 
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Q Gin glutamine 

E Gtu glutamic acid 

W Trp tryptophan 

R Arg arginine 

5 0 Asp aspartic acid 

N Asn asparagine 

C Cys cysteine 



It should be noted that all amino-actd residue sequences are represented herein by 
10 formulae whose left and right orientation ts in the conventional direction of amino- 
terminus to carboxy-terminus. Furthermore, it should be noted that a dash at the 
beginning or end of an amino acid residue sequence indicates a peptide bond to a 
further sequence of one or more amino-acid residues. The above Table is presented to 
correlate the three-letter and one-letter notations which may appear alternately herein. 

15 

A "replicon" is any genetic element (e.g., plasmid, chromosome, virus) that functions as 
an autonomous unit of DNA or RNA replication in vivo; i.e., capable of replication under 
its own control. Bradenbeck and Rice (1992) Semin. ViroL 3:297-310 contains a 
description of RNA re pi icons. 

20 

A "vector" is a replicon, such as a plasmid, phage or cosmid, to which another DNA (or 
RNA) segment may be attached so as to bring about the replication of the attached 
segment. 

25 A "DNA molecule" refers to the polymeric form of deoxyribonudeotides (adenine, 

guanine, thymine, or cytosine) in its either single stranded form, or a double-stranded 
helix. This term refers only to the primary and secondary structure of the molecule, and 
does not limit it to any particular tertiary forms. Thus, this term includes double- 
stranded DNA found, inter alia, in linear DNA molecules (e.g., restriction fragments), 

30 viruses, plasmids, and chromosomes. In discussing the structure of particular double- 
stranded DNA molecules, sequences may be described herein according to the normal 
convention of giving only the sequence in the 5' to 3' direction along the no n transcribed 

strand of DNA (i.e., the strand having a sequence homologous to the mRNA). 



35 An "RNA molecule" refers to the polymeric form of ribonucleotides (adenine, guanine, 
uridine, or cytosine) in its either single stranded for, or a double-stranded helix. This 
term refers only to th primary and secondary structure o the mol cule, and does not 
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limit it to any particular tertiary forms. Thus, this term includes single-stranded and 
double-stranded RNA found, inter alia, in linear or circular RNA molecules. In discussing 
the structure of particular RNA molecules, sequence may be described herein according 
to the normal convention of giving the sequence in the 5' to 3' direction. 
5- — 
An "origin of replication" refers to those DNA sequences that participate in DNA 
synthesis. 

A "coding sequence" or "open reading frame" is a nucleotide sequence which is 
10 transcribed and translated into a polypeptide in vivo when placed under the control of 
appropriate regulatory sequences. The boundaries of the coding sequence are 
determined by a start codon at the 5' (amino) terminus and a translation stop codon at 
the 3' (carboxyl) terminus. A coding sequence can include, but is not limited to, 
prokaryotic sequences, cONA from eukaryotic mRNA, genomic DNA sequences from 
15 eukaryotic (e.g., mammalian) DNA, and even synthetic DNA or RNA sequences. A 

polyadenyiation signal and transcription termination sequence will usually be located 3' 
to the coding sequence. 

Transcriptional and translational control sequences are DNA regulatory sequences, such 
20 as promoters, enhancers, polyadenyiation signals, terminators, and the like, that provide 
for the expression of a coding sequence in a host cell. 

A "promoter sequence" is a DNA regulatory region capable of binding RNA polymerase 
in a cell and initiating transcription of a downstream (3' direction) coding sequence. For 

25 purposes of defining the present invention, the promoter sequence is bounded at its 3' 
terminus by the transcription initiation site and extends upstream |5' direction) to 
include the minimum number of bases or elements necessary to initiate transcription at 
levels detectable above background. Within the promoter sequence will be found a 
transcription initiation site (conveniently defined by mapping with nuclease S1), as well 

30 as protein binding domains (consensus sequences) responsible for the binding of RNA 
polymerase. Eukaryotic promoters will often, but not always, contain "TATA" boxes 
and "CAT" boxes. Prokaryotic promoters contain Shine-Dalgarno sequences in addition 
— ___to the -10 and -35 consensus sequences. Promoter sequences can also be used to refer 
to analogous RNA sequences or structures ofsimilarfunction in-RNA_viruArepJica^on_^ 

35 and transcription. 
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An "expression c ntrol sequence" is a DNA sequence that controls and regulates the 
transcription and translation f another DNA sequence. A coding sequence is "under 
the control" of transcriptional and translational control sequences in a cell when RNA 
polymerase transcribes the coding sequence into mRNA, which is then translated into 
5. the protein encoded by the coding sequence. RNA sequences can also serve as — ~ 
expression control sequences by virtue of their ability to modulate translation, RNA 
stability, and replication (for RNA viruses). 

A "signal sequence" can be included before the coding sequence. This sequence 
10 encodes a signal peptide, N-terminal to the polypeptide, that communicates to the host 
cell to direct the polypeptide to the cell surface or secrete the polypeptide into the 
media, and this signal peptide is clipped off by the host cell before the protein leaves 
the cell. Signal sequences can be found associated with a variety of proteins native to 
prokaryotes and eukaryotes. 

15 

The term "oligonucleotide," as used herein in referring to the probe of the present 
invention, is defined as a molecule comprised of two or more ribonucleotides, preferably 
more than three. Its exact size will depend upon many factors which, in turn, depend 
upon the ultimate function and use of the oligonucleotide. 

20 

The term "primer" as used herein refers to an oligonucleotide, whether occurring 
naturally as in a purified restriction digest or produced synthetically, which is capable of 
acting as a point of initiation of synthesis when placed under conditions in which 
synthesis of a primer extension product, which is complementary to a nucleic acid 

25 strand, is induced, i.e., in the presence of nucleotides and an inducing agent such as a 
DNA polymerase and at a suitable temperature and pH. The primer may be either 
single-stranded or double-stranded and must be sufficiently long to prime the synthesis 
of the desired extension product in the presence of the inducing agent. The exact 
length of the primer will depend upon many factors, including temperature, source of 

30 primer and use of the method. For example, for diagnostic applications, depending on 
the complexity of the target sequence, the oligonucleotide primer typically contains 1 5- 
25 or more nucleotides, although it may contain fewer nucleotides. 



The primers herein are selected to be "substantially" complementary-to-drfferent-strands. 
35 of a particular target DNA sequence. This means that the primers must be sufficiently 
complementary to hybridize with their respective strands. Therefore, the primer 
s quence need not reflect th exact sequence of the template. For exampl , a non- 
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extension product. 

As .« to*. « — ■— — — «— * - •'• 5 ° iCto ' enn ""f * 
.nz,™, -oh »< «** «■ ° W «" " ^ 

10 nucleotide sequence. 

A eel, has been "transformed" by exogenous or heterotogous DNA when such DNA ^has 
been introduced Inside the cell. The transforming DNA may or may 1)6 nt ^ rated 
(covalentty linked) into chromosomal DNA making up the genome of 

^ZT anepisoma,«ementsuchas.p.asm^ V^h respect to euka.ot^, 
nsf ormed cel. is one in which *. transforming DNA has become integrated 

r.L— — " * ^ - throu9 : tz: 

repiication. This stability is demonstrated by the abiiity of the eukaryofc cel. o 
replication. i™ MOU iation of daughter cells containing the 

20 establish cell lines or clones composed of a population o a . ... or 
„ «„ONA a -clone" is a population of cells derived from a angle cell or 
transformmgONA. A clone . pop ^ ^ „ ^ of 

common ancestor by mitosis. A cell Una is a cio 

.tabie growth ft W«n, for many generations. This definition can be applied to RN A 

wHchcanbeusedto transformer -transfers. For 80m ® viruses, 

:lls,s U p P o rt virusrep,^onand,insome^ 
particles. 

Two DNA or RNA sequences are -substantially homologous" when at least about 75% 
30 ZeZl .east about 80%. and most preferably at least about SO or 9 S%> of the 
" dmat.overthed.ned.ngthoftheD.Ase.uences. Su- 
bstantially homologous can be identified by comparing the sequences using stand rd 
suostam v southern hybridization experiment 

software available in sequence data banxs, or m o 

T«L DNA Co™.,. Vote, ltd. Mud* Add H yb r,d,»»<,n. «... M.„ 
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are r f erred to as "homologous." In the extreme case, such s quences could be 
unrelated in terms of linear sequ nee identity, but may have substantially similar 
secondary structure. 

5 A "heterologous" region of a DNA or RNA construct is an identifiable segment of -DNA 
or RNA molecule within a larger nucleic acid that is not found in association with the 
larger molecule in nature. For instance, when the heterologous region encodes a 
mammalian gene, the gene will usually be flanked by DNA that does not flank the 
mammalian genomic DNA in the genome of the source organism. Another example of a 
1 0 heterologous coding sequence is a construct where the coding sequence itself is not 
found in nature (e.g. # a cDNA where the genomic coding sequence contains introns, or 
synthetic sequences having cod on s different than the native gene). Allelic variations or 
naturally-occurring mutational events do not give rise to a heterologous region of DNA 
as defined herein. 

15 

An "antibody" is any immunoglobulin, including antibodies and fragments thereof, that 
binds a specific epitope. The term encompasses polyclonal, monoclonal, and chimeric 
antibodies, the last mentioned described in further detail in U.S. Patent Nos. 4,816,397 
and 4,816,567. 

20 

An "antibody combining she" is that structural portion of an antibody molecule 
comprised of heavy and light chain variable and hypervariable regions that specifically 
binds antigen. 

25 The phrase "antibody molecule" in its various grammatical forms as used herein 

contemplates both an intact immunoglobulin molecule and an immunologically active 
portion of an immunoglobulin molecule. 

Exemplary antibody molecules are intact immunoglobulin molecules, substantially intact 
30 immunoglobulin molecules and those portions of an immunoglobulin molecule that 
contains the paratope, including those portions known in the art as Fab, Fab', F(ab') 2 
and F(v), which portions are preferred for use in the therapeutic methods described 
herein. 



35 
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Th ofilopolous t al Fab' antibody molecut p rtions are also well-known and are 
produced fr m F(ab') 2 portions followed by r ducti n f the disulfide bonds linking the 
two heavy chain portions as with mercaptoethanol, and followed by alkylation of the 
resulting protein mercaptan with a reagent such as iodoacetamide. An antibody 
5 containing intact antibody molecules is preferred herein. - — ~ 

The phrase, "monoclonal antibody** in its various grammatical forms refers to an 
antibody having only one species of antibody combining site capable of immunoreacting 
with a particular antigen. A monoclonal antibody thus typically displays a single binding 
10 affinity for any antigen with which it immunoreacts. A monoclonal antibody may 

therefore contain an antibody molecule having a plurality of antibody combining sites, 
each immunospecific for a different antigen; e.g., a bispectftc (chimeric) monoclonal 
antibody. 

1 5 The phrase "pharmaceuticaily acceptable** refers to molecular entities and compositions 
that are physiologically tolerable and do not typically produce an allergic or similar 
untoward reaction, such as gastric upset, dizziness and the like, when administered to a 
human. 

20 The phrase "therapeutically effective amount" is used herein to mean an amount 
sufficient %o prevent, and preferably reduce by at least about 30 percent, more 
preferably by at least 50 percent, most preferably by at least 90 percent, a clinically 
significant change in the S phase activity of a target cellular mass, or other feature of 
pathology such as for example, elevated blood pressure, fever or white cell count as 

25 may attend its presence and activity. 

A DNA sequence is "operatively linked*" to an expression control sequence when the 
expression control sequence controls and regulates the transcription and translation of 
that DNA sequence. The term "operatively linked" includes having an appropriate start 

30 signal (e.g., ATG or AUG) in front of the DNA sequence to be expressed and 

maintaining the correct reading frame to permit expression of the DNA sequence under 
the control of the expression control sequence and production of the desired product 
. encoded by the DNA sequence. If a gene that one desires to insert into a racombinant 
DNA molecule does not contain an appropriate~start:signalrSU(^-a-start-signal_cajiie__ 

35 inserted in front of the gene. 
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The term "standard hybridization conditions" in general refers to salt and temperature 
c ndrti ns substantially quivalent to 5 x SSC and 65 °C for b th hybridization and 
wash. However, one skilled in the art will appreciate that such "standard hybridization 
conditions" are dependent on particular conditions including the concentration of sodium 
5 and magnesium in the buffer, nucleotide sequence length and concentration, percent 
mismatch, percent formamide, and the like. Also important in the determination of 
"standard hybridization conditions" is whether the two sequences hybridizing are RNA- 
RNA, DNA-DNA or RNA-DNA. Such standard hybridization conditions are easily 
determined by one skilled in the art according to well known formulae, wherein 
10 hybridization is typically 10-20T below the predicted or determined T m with washes of 
higher stringency, if desired. 

By "HCV" is meant a diverse group of related viruses classified as a separate genus in 
the flavivirus family. The characteristics of this genus are described in the Background 
15 of the Invention above, and include such members as HCV-1 , HC-J1, HC-J, HCV-BK, 
HCV-H, HC-J6, HC-J8, HC-J483, HC-J491, HC-C2, HCV-JK, HCV-N, HCV-T, HCV-JT, 
HC-G9, HCV-K3a, NZL1 , HCV-Tr and the like. 

In its primary aspect, the present invention concerns the identification of novel terminal 
20 sequences present in HCV positive-sense genomic RNA. 

In a particular embodiment, the present invention relates to novel 5' and 3' terminal 
sequences which are highly conserved by ail members of the herein disclosed HCVs. 

25 As stated above, the present invention also relates to a recombinant DNA or RNA 

molecule or cloned gene, or a degenerate variant thereof, which encodes an HCV, or a 
fragment thereof, which has a nucleotide sequence or is complementary to a nucleotide 
sequence shown in SEQ ID N0S:1-4 or that shown FIGURE 3 (SEQ ID NOS:20-24; the 
parts of sequences downstream of the poly (U) tract only), FIGURE 6 (SEQ ID NOS:28- 

30 31) and FIGURE 8 (SEQ ID NOS:33-36). 

The invention also relates to infectious HCV cDNA clones comprising previously 
disclosed 5' non-coding, coding and 3' non-coding sequences including those encoding 
poly (U) or poly (A) tracts ((HCV^racc;* M6 2321 ; HG- J 1-acc #-D 1 07,49; HC^Jaca#__ 
35 D90208; HCV-BK acc.# M58335; HCV-H acc.# M67463; HC-J6 acc.# D00944; HC-J8 
acc.# D01221; HC-J483 acc.# D13558; HC-J491 acc.# D10750; HC-C2 acc.# 
D10934; HCV-JK acc.# X61596; HCV-N acc.# S62220; HCV-T acc.tf M84745; HCV- 
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JT acc.# D01171; HCV-JT acc.# D01172; HC-G9 acc.# D14853; HCV-K3a acc.# 
D28917; NZL1 acc.tf D17763; HCV-Tr acc.# D26556 and thers), the p lypyrimidine 
tract and the novel 3' element of 101 bases (SEQ ID NO:1c, d or related sequences). 

5. The possibilities both diagnostic and therapeutic that are raised by the existence- o4-the 
full length HCV clone, derive from the fact that the terminal sequences of viral genomes 
can be critical for the generation of functional, replication competent RNAs (Boyer et al, 
J. Gen. ViroL 198:415-426). As suggested earlier and elaborated further on herein, the 
present invention contemplates pharmaceutical intervention in the infectious life cycle of 



Thus, in instances where it is desired to inhibit infecttvity of HCV, an appropriate 
inhibitor of the 3' sequence element could be introduced to block the interaction of the 
initiation of negative- and positive-strand synthesis required for viral replication. 
1 5 Correspondingly, infectivity may be remedied by the introduction of additional quantities 
of a nucleic acid molecule encoding the 3' sequence element or its chemical or 
pharmaceutical cognates, analogs, fragments and the like. 

As discussed earlier, the molecules or agents exhibiting either mimicry or antagonism to 
20 the 3' sequence element, or control over the replication of the HCV, may be prepared in 
pharmaceutical compositions, with a suitable carrier and at a strength effective for 
administration by various means to a patient experiencing an adverse medical condition 
associated with HCV for the treatment thereof. A variety of administrative techniques 
may be utilized, among them parenteral techniques such as subcutaneous, intravenous 
25 and intraperitoneal injections, catheterizations and the like. Average quantities of the 
molecules or their subunits may vary and in particular should be based upon the 
recommendations and prescription of a qualified physician or veterinarian. 

As suggested earlier, the diagnostic method of the present invention comprises 
30 examining a cellular sample or medium by means of an assay including an effective 

amount of a binding partner to the HCV RNA. As previously discussed, patients capable 
of benefiting from this method include those suffering from infection by HCV. Methods 
for isolating the molecules which bind HCV sequences to assist in the examination of 
the target cells are all well-known in the art^ — 



The present invention further contemplates therapeutic compositions useful in practicing 
the therapeutic methods of this invention. A subject therapeutic compositi n includes, 



10 



HCV. 



35 



WO 97/08310 



PCT/US96/14033 



26 

in admixtur , a pharmaceutically acceptable excipient (carrier) and one or more f a 3' 
terminal sequ nee lem nt, analog thereof or fragment thereof, or a molecule which 
inhibits the properties or activities of that sequence element, as described herein as an 
active ingredient. In a preferred embodiment, the composition comprises a molecule 
5 capable of modulating the secondary structure formation of the 3' terminus of-ttrcrHCV 
RNA, and/or the initiation of negative- and positive-strand RNA synthesis and/or RNA 
packaging. 

A further therapeutic composition includes a full-length attenuated HCV which can be 
10 used as a vaccine against HCV infection, wherein a sequence within the genome of 
HCV, in particular the 3' terminal sequence element (SEQ ID N0S:1-4) has been 
modified, either as a naturally-occurring isolate, or via in vitro evaluation or site-directed 
mutagenesis. Particular mutations suitable for such attenuated viruses include those 
which alter the structure of the 3' terminus of the HCV genome RNA (or the 5' terminus 
15 of negative sense HCV RNA), and, by so doing, alter the initiation and/or translation of 
negative- and positive-strand RNA synthesis and/or RNA packaging. Such modifications 
may be aided by computer modelling and evaluated using infectivrty assays. 

The preparation of therapeutic compositions which contain the 3' terminal sequence 
20 element or antagonist thereof, or the full-length attenuated HCV, analogs or active 
fragments as active ingredients is well understood in the art. Typically, such 
compositions are prepared as injectables, either as liquid solutions or suspensions, 
however, solid forms suitable' for solution in, or suspension in, liquid prior to injection 
can also be prepared. The preparation can also be emulsified. The active therapeutic 
25 ingredient is often mixed with excipients which are pharmaceutically acceptable and 
compatible with the active ingredient. Suitable excipients are, for example, water, 
saline, dextrose, glycerol, ethanol, or the like and combinations thereof, in addition, if 
desired, the composition can contain minor amounts of auxiliary substances such as 
wetting or emulsifying agents, pH buffering agents which enhance the effectiveness of 
30 the active ingredient. 

A polypeptide, analog or active fragment can be formulated into the therapeutic 
composition as neutralized pharmaceutically acceptable salt forms. Pharmaceutically 

acceptable salts include the acid addition salts (formed witxTtrieTfree amino groups of 

35 the polypeptide or antibody molecule) and which are formed with inorganic acids such 
as, for example, hydrochl ric or phosph ric acids, or such organic acids as acetic, 
oxalic, tartaric, mandelic, and the like. Salts formed from the free carboxyl groups can 
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als be derived from inorganic bases such as, for example, sodium, potassium, 
ammonium, calcium, or ferric hydroxides, and such organic bases as isopropylamin , 
trimethylamine, 2-ethylamino ethanol, histidine, procaine, and the like. 

The therapeutic compositions are conventionally administered intravenously, as by-^ 
injection of a unit dose, for example. The term "unit dose" when used in reference to a 
therapeutic composition of the present invention refers to physically discrete units 
suitable as unitary dosage for humans, each unit containing a predetermined quantity of 
active material calculated to produce the desired therapeutic effect in association with 
the required diluent; i.e., carrier, or vehicle. 

The compositions are administered in a manner compatible with the dosage formulation, 
and in a therapeutically effective amount. The quantity to be administered depends on 
the subject to be treated, capacity of the subject's immune system to utilize the active 
ingredient, and degree of inhibition or neutralization of HCV desired. Precise amounts of 
active ingredient required to be administered depend on the judgment of the practitioner 
and are peculiar to each individual. However, suitable dosages may range from about 
0.1 to 20, preferably about 0.5 to about 10, and more preferably one to several, 
milligrams of active ingredient per kilogram body weight of individual per day and 
depend on the route of administration. In the case of attenuated virus used as a 
vaccine, dosages could range from 10 to 10* infectious doses. For inactivated viral 
vaccines, higher doses of HCV antigen and a suitable adjuvant could be required. 
Suitable regimes for initial administration and booster shots are also variable, but are 
typified by an initial administration followed by repeated doses at one or more hour 
intervals by a subsequent injection or other administration. Alternatively, continuous 
intravenous infusion sufficient to maintain concentrations of ten nanomolar to ten 
micro molar in the blood are contemplated. 

The therapeutic compositions may further include an effective amount of the 3' 
sequence element, its antagonist, or analog thereof, in combination with an antibiotic, a 
steroid, interferon or other anti-HCV therapeutic. Exemplary formulations are given 
below: 
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Intravenous Formulation I 
Ingredient 
5. cefotaxime 

HCV, fragment or antagonist 
dextrose USP 
sodium bisulfite USP 
edetate disodium USP 
10 water for injection q.s.a.d. 

Intravenous Formulation II 
Ingredient 
ampicUlin 
1 5 HCV, fragment or antagonist 
sodium bisulfite USP 
disodium edetate USP 
water for injection q.s.a.d. 

20 Intravenous Formulation III 
Ingredient 

gentamidn (charged as sulfate) 
HCV, fragment or antagonist 
sodium bisulfite USP 
25 disodium edetate USP 

water for injection q.s.a.d. 

Intravenous Formulation IV 
Ingredient 
30 HCV, fragment or antagonist 
dextrose USP 
sodium bisulfite USP 

~edetate~disodium-USP — 

water for injection q.s.a.d. 
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Formulations 



mg/ml 

250.0 

10.0 

45.0 

3.2 

0.1 

1.0 ml 



mg/ml 

250.0 

10.0 

3.2 

0.1 

1.0 ml 



ma/mi 

40.0 
10.0 
3.2 
0.1 

1.0 ml 



mg/ml 

10.0 

45.0 

3.2 

0.1 

TOmT 



35 
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Intravenous Formulation V 



Ingredient 



mo /ml 



HCV, fragment or antagonist 



5.0 



sodium bisulfite USP 



3.2 



5- 



disodium edetate USP 



0.1 



water for injection q.s.a.d. 



1.0 ml 



As used herein, "pg" means picogram, "ng" means nanogram, "ug" or "//g" mean 
microgram, "mg" means milligram, "ul" or m jA m mean microliter, "ml" means milliliter, "I" 
10 means liter. 

A wide variety of cell types may be useful as host cells for HCV replication, as initiated 
using the functional HCV cDNA clones created by incorporation of sequences from the 
present invention. These host celts may include primary human cells (e.g., hepatocytes, 

15 T-cells, B-cells, monocytes/macrophages, foreskin fibroblasts) as well as continuous 
human cell lines [e.g., HepG2, Huh7 # HUT78, HPB-Ma, MT-2 <and other HTLV-I and 
HTLV-II infected T-cell lines), Namaiowa, Daudi, EBV-transformed LCLs]. In addition, 
continuous cell lines which are readily transfected with RNA and permissive for 
replication of flaviviruses or pestiviruses may support HCV replication (e.g. SW-1 3, 

20 Vero, BHK-21 , COS, PK-1 5, MBCK). One skilled in the art will be able to select the 
proper host cells without undue experimentation to accomplish the desired infectivrty 
assay without departing for the scope of this invention. 

It is further intended that 3* terminal sequence analogs or HCV analogs may be prepared 
25 from nucleotide sequences derived within the scope of the present invention. Analogs, 
such as fragments or mutants {e.g., "muteins,") can be produced by standard deavage 
by restriction enzymes, or site-directed mutagenesis of the HCV coding and non-coding 
(5' and 3' terminal) sequences. Analogs exhibiting "HCV inhibiting activity" such as 
small molecules, whether functioning as promoters or inhibitors, may be identified by 
30 known in vivo and/or in vitro assays. f 

As mentioned above, a ON A sequence encoding the 3* sequence element, its 
complement, or the full length wild-type or attenuated HCV can be prepared 

synthetically rather than cloned. The DNA~sequencecan-be designed-with-the- 

35 appropriate codons for amino acid sequence encoded by the HCV open reading frame. 
The complete sequence is assembled from overlapping oligonucleotides prepared by 
standard methods and assembled into a complete coding sequ nee. See, e.g.. Edge, 
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Nature, 292:756 (1981); Nambair et al. Science, 223:1299 (1984); Jay et al. J. Biol. 
Chem., 255:6311 (1984). 

The ability of the 3' terminal sequence element to drive replication-competent HCV 
5 RNAs can be analyzed by using constructs in which the ORF of HCV has been replaced 
by a reporter gene, such as luciferase, which can be detected directly and correlated to 
the level of HCV RNA in the cell. In particular, the 3' terminal sequence element can he 
used to derive replication competent HCV RNAs, either fulMength RNAs capable of 
complete replication and virus production or repficons. Such replicons would be capable 

10 of RNA replication, but might lack the structural region/packaging machinery and hence 
not produce virus. Cells transfected/transformed with such replicons (containing the 3' 
element) would be useful for inhibitors of HCV RNA replication, including those which 
might interfere with the function of the 3' element or its complement. RNA replication 
could be assessed either by looking directly at HCV RNA levels (RT/PCR, B-DNA, 

15 Northern blot analyses) or by incorporating a sensitive reporter (like luciferase) under the 
control of the HCV RNA replication and translational machinery. 

Synthetic DNA sequences allow convenient construction of genes which will express 
HCV, HCV variants or attenuated HCV. Alternatively, DNA encoding variant or 
20 attenuated HCV can be made by site-directed mutagenesis of native HCV cONAs. 

A general method for site-specific incorporation of unnatural amino acids into proteins is 
described in Christopher J. Noren, Spencer J. Anthony-Cahill, Michael C. Griffith, Peter 
G. Schultz, Science, 244:182-188 (April 1989). This method may be used to create 
25 HCV virions containing proteins with unnatural amino acids. 

The present invention extends to the preparation of amis ens e nucleotides and ribozymes 
that may be used to interfere with HCV RNA translation, stability, 
replication/transcription and/or packaging. This approach utilizes antisense nudeic acid 
30 and ribozymes to block viral replication, either by masking the HCV RNA with an 
antisense nucleic acid or cleaving it with a ribozyme. 

Antisense nucleic acids are DNA or RNA molecules that are complementary to at least a 

portion of a specific RNA moleculeT(See~We1ntfaubr1 990; Marcus-Sekurar-1988T)-ln 

35 the cell, they hybridize to that RNA, forming a double stranded molecule. Therefore, 
antisense nucleic acids may interfere with viral replication (either by a direct blocking 
ff ct or by leading to degradation f the target RNA by cellular enzymes). Oligom rs 
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of about fifteen nude tides ar appr priate, since they ar asy to synthesize and are 
likely to po_s few r problems than larger mol cules when introducing them into HCV- 
infectible cells. However, also suitable are oligonucleotides of natural structure or those 
with modifications to enhance stability, facilitate, uptake, etc. In addition, longer 
EL antisense RNAs can be generated in vivo In hepatocytes or other HCV target cefls*13sing 
gene therapy approaches. Antisense methods have been used to inhibit the expression 
of many genes in vitro (Marcus-Sekura, 1988; Hambor et al, 1988). 

Ribozymes are RNA molecules possessing the ability to specifically cleave other single 
1 0 stranded RNA molecules in a manner somewhat analogous to DNA restriction 

endonucieases. Ribozymes were discovered from the observation that certain mRNAs 
have the ability to excise their own introns. By modifying the nucleotide sequence of 
these RNAs, researchers have been able to engineer molecules that recognize specific 
nucleotide sequences in an RNA molecule and cleave it (Cech, 1988.). Because they 
1 5 are sequence-specific, only RNAs <such as the HCV positive-sense genome RNA or its 
complement) with particular sequences are inactivated. 

investigators have identified two types of ribozymes, Tetrahymena-type and 
"hammerhead'-type. (Hasselhoff and Gerlach, 1988) Tetrahymena-type ribozymes 

20 recognize four-base sequences, while "hammerhead "-type recognize eleven- to 

eighteen-base sequences. The longer the recognition sequence, the more likely it is to 
occur exclusively in the target mRNA species. Therefore, hammerhead-type ribozymes 
are preferable to Tetrah ymena-vypQ ribozymes for inactivating a specific mRNA species, 
and eighteen base recognition sequences are preferable to shorter recognition 

25 sequences. 

The DNA sequences described herein may thus be used to prepare antisense molecules 
against, and ribozymes that cleave HCV RNA. It should be appreciated that such 
antisense molecules and ribozymes will encompass nucleotide sequences of both 
30 positive and negative strand polarity, such that they may bind to both the positive and 
negative strand HCV RNAs. 

The present invention also relates to a variety of diagnostic applications, including 

methods for detecting HCV. The ihventidTrals^rrela^te^to^methods^correlating^ 

35 NTR sequences with various clinical parameters such as disease severity, response to 
treatment with interferon, and immune status, or to determine tissue tropism (predictive 
diagnostics.) 



WO 97/08310 



PCT/US96/14033 



32 

The pr sent HCV sequences can be used experimentally t identify HCV isolates 
containing additional 3' sequences, to determine 3' NTR sequences f r various HCV 
genotypes to define further areas of conservation and divergence. Also, the chemical 
modification and analysis of the RNA by RNAse mapping and three-dimensional 
5. structure analysis of the RNA is an aid to identifying host or viral factors which fnTSract 
with the sequence and/or identifying molecules which inhibit replication. 

HCV RNA can also be used therapeutically, i.e., attenuated HCV can be used for 
vaccine development, and the 3' NTR sequence element may be used as a trans- 
10 dominant inhibitor of HCV replication via gene therapy. 

Replication of HCV in cells can be ascertained by branched ONA (B-DNA), quantitative 
RT/PCR and immunological procedures, or using standard methods for determining virus 
titer (i.e., titration in the chimpanzee). The procedures and their application are all 
1 5 familiar to those skilled in the art and accordingly may be utilized within the scope of 
the present invention. A "competitive" antibody binding procedure is described in U.S. 
Patent Nos. 3,654,090 and 3,850,752. A "sandwich" procedure, is described in U.S. 
Patent Nos. RE 31,006 and 4,016,043. Still other procedures are known such as the 
"double antibody", or "DASP" procedure. 

20 

in each instance, HCV proteins form complexes with one or more antibody(ies) or 
binding partners and one member of the complex is labeled with a detectable label. 
Alternatively, an antibody may be raised, or identified in HCV-infected patients, which 
binds to the present HCV 3' terminal sequence element. The fact that a complex has 
25 formed and, if desired, the amount thereof, can be determined by known methods 
applicable to the detection of labels. 

Also within the scope of the invention are RNA molecules which mimic the 3' terminal 
sequence element or its complement or bind to these elements, selected in vrtro using 

30 the "SELEX" (Tuerk and Gold, 19_J or other in vrtro selection/evolution approaches. 
These methods provide libraries of RNAs with randomized sequences which can be 
selected by reiterative binding to a target (in this case, the 3' terminal sequence 

element, its complement, or the cognate binding partners required for the functions of 

these respective elements), and RNAs bound thereto are amplified by"PCR^Such 

35 molecules may either mimic the structure of the 3' terminal element or be competitive 
inhibitors of the 3' terminal sequence element. Such SELEX RNAs may be suitable for 
diagnostic and ven therap utic uses within the scop of the present invention. 
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Alternatively, the presence of HCV RNA can b determined by Northern analysis, 
polymerase chain reaction (PCRI, primer xtensi n, and th like. 

The labels most commonly employed for these studies are radioactive elements, 
5 enzymes, chemicals which fluoresce when exposed to ultraviolet light, and othersT" 

A number of fluorescent materials are known and can be utilized as labels. These 
include, for example, fluorescein, rhodamine, auramine, Texas Red, AMCA blue and 
Lucifer Yellow. A particular detecting material is anti-rabbit antibody prepared in goats 
10 and conjugated with fluorescein through an isothiocyanate. 

An antibody to HCV proteins, or a probe for HCV RNA or their binding partner(s) can 
also be labeled with a radioactive element or with an enzyme. The radioactive label can 
be detected by any of the currently available counting procedures. The preferred 
15 isotope may be selected from 3 H, 14 C, M P, "S, 36 CI, 6, Cr. "Co, "Co, 69 Fe, *>Y, m l, ,J1 I, 
and 1M Re. 

Enzyme labels are likewise useful, and can be detected by any of the presently utilized 
colorimetric, spectrophotometric, fluorospectrophotometric, amperometric or gasometrtc 

20 techniques. The enzyme is conjugated to the selected probe by reaction with bridging 
molecules such as carbodiimides, diisocyanates, glutaraldehyde and the like. Many 
enzymes which can be used in these procedures are known and can be utilized. Those 
preferred are peroxidase, ^-glucuronidase, O-D-glucosidase, &-D-galactosidase, urease, 
glucose oxidase plus peroxidase and alkaline phosphatase. U.S. Patent Nos. 

25 3,654,090; 3,850,752; and 4,016,043 are referred to by way of example for their 
disclosure of alternate labeling material and methods. In addition, a probe may be 
biotin-iabelled, and thereafter be detected with labelled avidin, or a combination of 
avidin and a labelled anti-avidin antibody. Probes may also have digoxygenin 
incorporated therein and be then detected with a labelled anti-digoxygenin and detected 

30 with a labelled anti-digoxygenin antibody. 

In a further embodiment of this invention, commercial test kits suitable for use by a 
medical specia list may be prepared to determine the presence or absence of infectious 

HCV in suspected patient samples. Such test kits may employ techniques such as 

35 RT/PCR, branched DNA and ligation chain reaction (LCR). In accordance with the 

testing techniques discussed abov , one class of such kits will contain at I ast a labeled 

HCV antibody r oligonucleotide probe r its binding partner, and directions, of course, 
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depending upon the method selected. The kits may also contain peripheral reagents 
such as buffers, stabilizers, etc. 

Accordingly, a test kit may be prepared for the demonstration of the presence of HCV, 
_5 comprising: - ~~ 

(a) a predetermined amount of at least one labeled oligonucleotide probe directed to 
a 3' sequence element of the HCV genome, obtained by the direct or indirect 
attachment of the oligonucleotide or a specific binding partner thereto, to a detectable 
label; 

10 (b) other reagents; and 

(c) directions for use of said kit. 

More specifically, the diagnostic test kit may comprise: 

(a) a known amount of the labeled oligonucleotide probe as described above (or a 
1 5 binding partner) generally bound to a solid phase to form an immunosorbent, or in the 

alternative, bound to a suitable tag, or plural such end products, etc. (or their binding 
partners) one of each; 

(b) if necessary, other reagents; and 

(c) directions for use of said test kit. 

20 

In accordance with the above, an assay system for screening potential drugs effective 
to modulate the replicative activity of the HCV RNA may be prepared. The infectious 
HCV RNA may be introduced into a test system, and the prospective drug may also be 
introduced into the resulting cell culture, and the culture thereafter examined to observe 
25 any changes in the replicative activity of the cells, due to the addition of the prospective 
drug. 

PRELIMINARY CONSIDERATIONS 

30 While a great deal of progress has been made in the last several years, there are still a 
vast number of unanswered questions concerning HCV replication, pathogenesis and 
immunity. The field is rapidly reaching a bottleneck some aspects of the functions of 
the HCV genome RNA and its encoded proteins are understood, but prior to the present 
invention, no way existeci~of experimentally testing structure/function questions in the 

35 context of authentic virus replication. Such analyses are critical for understanding each 
step in the virus fife cycle at a level which will allows the design of protective vaccines 
and effective therapy for chr nically infected patients. 
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The present invention stemmed from attempts to construct functional cONA clones f r 
HCV, in particular the HCV-H strain. Prior to the present invention, attempts to rec ver 
infectious HCV RNA from cDNA have been unsuccessful. Several possible explanations, 
alone or in combination, account for previous failures, including missing or incorrect 

5 . terminal sequences, internal errors deleterious or lethal for HCV replication, or ~ 

inadequate methods for assaying infectivrty and replication. 

Rationale for rigorously determining the HCV-H termini. 5' and 3' terminal sequences of 
HCV-H were previously unknown. Previous attempts to generate functional transcripts 

10 used terminal sequences determined for other HCV isolates. As mentioned above, work 
in other RNA virus systems has shown that specific terminal sequences can be critical 
for the generation of functional, replication competent RNAs (Boyer and Haenni, (1994) 
J. Gen. ViroL 198:415-426). Such sequences are believed to be involved in initiation of 
negative- and positive-strand RNA synthesis. Given the importance of these sequence 

1 5 elements for other viruses, the present invention more rigorously determined the HCV-H 
terminal sequences. 

Structure of the 5' NTR. Methods used to amplify and done the extreme 5' termini of 
RNAs include homopolymer tailing or ligation of synthetic oligonucleotides to first-strand 

20 cDNA (5' RACE), cyclization of first-strand cONA followed by inverse PCR (Zeiner and 
Gehring (1994) BioTechniques 17:1051-1053), or cyclization of genome RNA with RNA 
Ugase {after treatment to remove 5' cap structures, if necessary) followed by cONA 
synthesis and PCR amplification across the 5'-3' junction (Mandl et al (1991) 
Biotechniques 10:486). 5' terminal sequences were determined for a number of HCV 

25 isolates and are in general agreement. For HCV-H, both the cyclization Anverse PCR and 
5' RACE methods were used to determine a 5 '-terminal consensus sequence: 

5 '-G CCAG CCCCCTG ATGGGGGCG ACACTCCACCATGAAATC-3' (SEQ ID NO:37) 

30 This sequence is highly homologous to those determined for other isolates. At lower 
frequency, clones with additional 5' residues (usually 1 additional G) were also 
recovered. Although this might reflect additional sequences or heterogeneity at the 
HCV 5' terminus, these clones may be artif actual and created by partial copying of a 5' 
cap structure or addition of non-templated 3 '"bases"by reverse^transcriptase during 

35 first-strand cDNA synthesis. It cannot be excluded that the 5' terminus of HCV genome 
RNA contains a 5' cap structure or a covalently-linked terminal protein such as VPg of 
the picornaviruses. For the p stiviruses, recent studies sugg st that genome RNAs may 
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not contain a 5' cap {Brock et al (1992) J. Virol. Meth. 38:39-46) and that this 
structure is not required for infectivity f transcrib d RNA <R. D nis, R. J. Mo rman, 
personal communications). Consistent with these observations, neither HCV nor the 
pestiviruses contain motifs characteristic of virus-encoded enzymes involved in capping 
5- or methylation of cap structures (Rice (1995)). There is the possibility of a proteirP 
covalentiy-linked to the 5' terminus of HCV genome RNA, but synthetic RNA transcripts 
corresponding to viruses that normally contain such 5' structures are infectious, 
indicating that they are usually not an absolute requirement for initiation of replication. 

10 EXAMPLE 1 

Structure of the HCV-H 3* NTR< Determination of the extreme 3' terminal HCV 
sequences presented a greater challenge. Due to limited quantities of HCV genome 
RNA, the classic method of 3'-end labeling and direct RNA sequence analysis has not 

15 been feasible. One of the first reports suggested that HCV RNAs contained 3' terminal 
poly (A) tracts (for HCV-1) Han et al (1991) used tagged oligo (dD primers for cDNA 
synthesis followed by PCR amplification, cloning and sequence analysis. This is not an 
acceptable method for 3' end determination as it already presupposes a 3' poly (A) or 
polypurine tract and would select for such RNAs even if they were present in low 

20 abundance. Some reports utilized f. coti poly (A) polymerase to add 3' homopotymer 
tracts prior to oligo (dD-primed cDNA synthesis and found evidence of a 3' terminal 
poly (U) (i.e.. Kato et al, 1990) or isolated 3' clones with poly (U) tracts from randomly 
primed cDNA libraries. To actually determine the 3' terminal sequence and to solve this 
contradiction [3' poly (A) versus poly (U)J, 5' RACE was used to determine the 5' 

25 terminal sequence of HCV negative-strand RNA (Chen et al, 1992). This study 

predicted a 3' terminal poly (U) tract for the HCV genome RNA. Subsequently, other 
groups have not attempted to determine actual 3' termini, but rather assumed a 3' poly 
(U) tract and used oligo (dA) for priming cDNA synthesis or isolated 3' clones with poly 
(U) tracts from randomly primed cDNA libraries. Depending upon the actual 3' 

30 structure, ail of these approaches have potential problems (some are discussed below) 
and a critical reading of the literature makes it clear that the 3' end of the genome RNA 
has until now been poorly characterized and uncertain at best. Alternative approaches 
fordetermining-the-terminaLsec|uence of HCV-H were therefore pursued. 



35 One such approach, mentioned above for 5' end determination, is to cyclize the RNA 
using T4 RNA ligase followed by cDNA synthesis and amplification using a 
negative-sense primer complementary to a sequ nc in the 5' NTR and a positive-sense 
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primer near the 3' end (Mandl et al, 1991). Ideally, cloning and sequence analysis of 
such products should provide information on 5' and 3' terminal sequenc s present in the 
same RNA molecule. In the case of HCV, since the 5' terminus has been reasonably 
well-defined by other methods, these data should allow determination of the 3' 
5- terminus. Unfortunately, despite repeated attempts, this method has failed for * 
determination of HCV-H terminal sequences (even using RNA obtained from high 
specific-infectrvity plasma). Potential problems include (i) a blocked 5' terminus (ii) lack 
of a 3' -OH group or a poor acceptor for RNA ligase (such as a U residue; Moore and 
Sharp, 1 992) (iii) ribonuclease activity during RNA preparation or RNA ligation (see 

10 below) or (iv) terminal RNA structures sterically inhibiting 5'-3' ligation. In any case, 
recent work has shown this method to be unreliable, even when using large quantities 
of purified, initially intact TBE genome RNA. Likely problems include RNase 
contamination in most commercial preparations of tobacco acid pyrophosphatase and 
T4 RNA ligase (A. A. Kolykhalov, unpublished data) and probable hypersensitivity of the 

15 poly (U)/potypyrimidine tract to the action of RNases. In any case, poly (A) was 
incorrectly assigned as the terminal sequence of TBE genome RNA, whereas later 
experiments demonstrated that such poly (A) tracts were internal and followed by 
additional sequences including a 3' terminal hairpin structure (C. Mandl, personal 
communication). Interestingly , the correct 3' structure, but not the poly (A) tract, is 

20 required for infecttvity of transcribed TBE RNA (C. Mandl, personal communication). 

Two other methods were considered. For determining the YF 3' terminus, £. colt poly 
(A) polymerase had been used to add 3' terminal poly (A), followed by oligo (dT) priming 
and a selective cloning strategy. The YF 3' terminus was cloned with great difficulty 

25 and found to be a highly stable hairpin structure (Hahn et at, 1987; Rice et al, 1985). 
However, this approach was not considered since addition of 3' poly (A) would allow 
self-priming at the HCV poly (U) tract and subsequent elimination of potential sequences 
in between during second-strand cDNA synthesis and cloning. Rather, an alternative 3' 
RACE method was used in which a synthetic oligodeoxynucleotide, present at high 

30 concentrations, was ligated to the 3' end of the RNA to serve as a specific priming site 
for cDNA synthesis (Figure 1). Ligation conditions were optimized by assaying the 
ability of T4 RNA ligase to ligate 5 '-end-labeled oligonucleotides to a synthetic acceptor 
RNA (Brennan et al (198 3) Meth. Enz. 100:38-52). Critical parameters included the 
batch of RNA ligase (many were heavily contaminated witFRNa^)7the~concentration 

35 of DMSO (20-30%), and the particular oligonucleotide used for ligation (A. A. 

Kolykhalov, unpublished). For the 3' analysis of HCV-H, 10* molecules of RNA were 
purified from high-titered H77 plasma, ligated to the synthetic ligonucleotide, and this 



WO 97/08310 



PCT/US96/14033 



38 

modified RNA used f r RT/PCR (Figure 1), On primer for cDNA synthesis and PCR 
amplification (tig B - SEQ ID NOS:6 and 7) was complementary to that used for 
ligation to the RNA (oligo A - SEQ ID NO:5); a second positive-sense primer 
corresponded to a sequence near the 3' end of the HCV ORF (oligo C - SEQ ID NO:8). 

& 

A smear of amplified products, as resolved by agarose gel electrophoresis, was obtained 
after 40 cycles of PCR amplification. This DNA was either subjected to additional PCI 
analyses or cloned directly for sequence determinations. The presence of predicted 
internal HCV sequences and homopolymer tracts was assayed using a nested positive 

10 sense primer and either oligo <dA) or oligo (dT). A product of the expected size (based 
on previous HCV 3' NTR sequences) was obtained using the oligo (dA) primer; no 
product was found using oligo (dT). Prototype HCV cDNA dones terminating In either 
poly (A) or poly (T) served as positive and negative controls for these primer pairs and 
gave the expected results. These data strongly suggest that HCV-H does not contain 

15 poly (A) but rather, as found for most HCV isolates, contains a poly (U) tract [or at least 
a site for priming by oligo (dA)] at or near its 3' terminus. From the cloned material 
[which had not be subjected to further amplification using oligo <dA)], sequences from 
20 independent dones were determined. Essentially all of these dones contained (5' to 
3% positive-sense) (i) the previously determined HCV-H sequence ( I nchauspe et al 

20 (1991) Proc. Natl. Acad. ScJ. USA 88:10292-10296) (ii) 40 bases homologous to other 
HCV isolates (Figure 2) (iit) poly (U) tracts of various lengths and (iv) the sequence of 
the oligonudeotide used for RNA ligation. Five independent dones, derived from two 
different PCR amplification experiments, were found to have unusual structures. 
Following variable lengths of poly (U) and polypyrimidine stretches consisting of mainly 

25 U with occasional interspersed C residues, four of these dones contained a novel 
sequence of 101 bases (SEQ ID NO:1) which was nearly identical in all dones (two 
dones differed by 1 substitution each; 1 clone terminated after only 39 bases of this 
sequence) (Figure 3). This 101 base sequence, particularly the 3' terminal 46 bases, is 
predicted (FOLDRNA, GCG package) to form a highly stable secondary structure 

30 reminiscent of the 3' termini of members of the flavivirus genus (Chambers et af (1990) 
Virology 177:159-174) (Figure 4). However, an exhaustive search (BLAST, FASTA) of 
the databases has revealed no entry showing significant homology to this novel HCV 

__seguence. 

35 Several lines of evidence suggest that this 101 base sequence is not an RT/PCR artifact 
and represents the 3' terminal sequence f HCV genome RNA. 
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First, as mention d abov , HCV-H clones with similar but not identical structures were 
obtained from two independent experiments (Figure 3). Thes clones differed in the 
length of the poly (U)/polypyrimidine tract and by a few base substitutions within the 
101 base element, but the sequences across the breakpoint between the novel 
5- sequence and the ligated oligonucleotide were identical in four clones. ~ 

In a second set of experiments, negative-sense oligonucleotides were designed based on 
the sequence of the 101 -base element, and used for RTYPCR amplification and cloning 
of HCV RNA from either HCV-H or four different clinical samples obtained from 
investigators around the country* Samples were obtained from patients with chronic 
hepatitis C and all were HCV RNA positive and of different genotypes (1b, 3 and two 
samples of genotype 4) than previously analyzed HCV-H(1a). Multiple independent 
clones were obtained and sequenced (Figure 6) for each amplified sample. As described 
above, for HCV-H, all clones contained identical sequences at the end of the ORF and 
the 3' NTR sequence preceding the poly (U)/polypyrimidine tract. This poly 
(U)/po!ypyrimidine tract was variable in length and followed by the novel 3' element. 
Clones from the other isolates had similar structures except that genotype-specific 
differences were observed in the ORF and 3' NTR sequence preceding the poly 
(U)/polypyrimldine tract. The sequence of the novel 3' element was present and 
absolutely identical in all of these clones suggesting that this element is both present in 
the genome RNAs of distinct genotypes and highly conserved. This experiment also 
proves that this structure is not an in vitro artifact generated by T4 RNA ligase. 

This analysis demonstrated that the novel 3' element was present in other HCV 
25 genotypes, but did not define the actual 3' end of these genome RNAs. Using a similar 
RNA ligation procedure (Figure 7), the 3' terminal sequences of these different HCV 
genotypes were determined. The same novel 3' terminal sequence (with one or two 
isolate-specif ic substitutions) was found joined, at exactly the same breakpoint, to the 
sequence of the oligonucleotide used for T4 RNA ligation (Figure 8). 

30 

A fourth experiment provides yet more evidence that the 3' novel sequence represents 
the 3' terminus of HCV genome RNA. It could be argued that clones with the novel 

struct ure could be obtai ned by internal priming within the 3' NTR if, by chance, the 3' 

portion of the synthetic primer used for cDNA synthesis (and PCR"amplification)-was 

35 complementary to a sequence within the HCV 3' NTR. To address this concern, the 
analysis was repeated using serum from a different patient (WD) and a distinct 
oligonucleotide ("oligo G N ; SEQ ID NO:37) in the RNA ligation step, whose sequence 



15 
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was not homolog us t th oligonucleotid used in the initial experiments. Th 
complement of this olig nucleotide P ligo H"; SEQ ID NO:38)was used for cDNA 
synthesis and PCR amplification together with "oligo F" (SEQ ID NO:32). and the 
products were cloned and sequenced. The same novel 3' terminal sequence was found 
S joined, at exactly the same breakpoint, to the sequence of the alternative synthetic"* 
oligonucleotide. 

This novel 3' NTR structure appears to be highly conserved among HCV isolates and is 
likely to be an essential RNA element required for virus replication and successful 
10 recovery of infectious HCV RNA from cDNA. 

Based on these data, the current picture of HCV-H genome structure is diagrammed in 
Figure 10. The genome RNA probably initiates with a G residue and contains a 5' NTR 
of 341 bases. The ORF consists of 9033 bases encoding a polyprotein of 301 1 amino 

15 acid residues. Following the opal (UGA) stop codon is a sequence of 40 bases, a poly 
(U) tract, a pofypyrimidine stretch, and a highly conserved RNA element of about 100 
bases. Some positive-strand RNA viruses (poliovirus, Sindbis virus) contain 3' terminal 
poly (A) but many others terminate with conserved RNA sequences which can often be 
folded into stable secondary structures (bromo viruses, fiavi viruses). Besides TBE 

20 isolates containing internal poly (A) followed by a 3' terminal secondary structure, there 
is one other example of a virus which contains a 3' NTR similar to that proposed here 
for HCV. This virus, called GBV-B, is one of the newly cloned and sequenced GB 
hepatitis agents (Simons et al (1995) Proc. Natl. Acad. ScL USA 92:3401-5). These 
two isolates (GBV-A and GBV-B) together with HGV (an agent cloned by Genelabs) 

25 appear to be most closely related to HCV. All appear to have positive-strand genome 
RNAs of -9-10 kb and a single long ORF encoding proteins with significant homology 
to those of HCV. In the case of GBV-B, the 3' NTR consists of 27 bases, a pory (U) 
tract, and an additional 

sequence of 49 bases (Simons et al, 1995). Other than the poly (U) tract, this 
30 sequence shows no significant homology with the HCV-H 3' NTR (A. A. Kolykhalov, 
unpublished). 

Discussion 

The present invention provides sequence data which demonstrate that the-genome-RNA 

35 of HCV-H (a type 1a isolate) appears not to terminate with a homopolymer tract as 
previously thought, but rather with a novel sequence of 101 bases. Furthermore, 
results sugg st that this 3' NTR structure and the putative 3' terminal element may be 
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features common to other HCV gen types. In additi n to the potential importance of 
the 3' NTR for HCV replication and recovery of auth ntic HCV from cDNA, the apparent 
conservation of the conserved 3' element has important applications for HCV 
diagnostics and therapy. Determination of HCV RNA levels in patient plasma and 
5- tissues is important not only for diagnosis of HCV infection in the absence of antiEocV 
response but also for following the efficacy of therapeutic regimens (Bresters et al 
(1994) J. Med. Virol. 43:262-8; Cha et al 11991) J. Clin. Microbiol. 29:2528-34; 
Chazouilleres et al (1994) Gastroenterology 106:994-9; Davis et al (1994) Hepatology 
19:1337-41; Feray et al (1994) Hepatology 20:1137-43; Gordon et al (1994) Am. J. 

10 Gastroenterol. 89:1458-61; Simmonds et al (1994) J. Gen. Virol. 75:1053-1061; 
Wright et al (1994) Hepatology 20:773-9). Current methods, such as quantitative 
RT/PCR or branched DNA, rely on conserved RNA targets in the HCV genome which can 
be either genus-, type*, or subtype-specific. Detection of this novel, conserved 
sequence may be a useful alternative for diagnosis of HCV infection. In terms of 

1 5 therapy, highly conserved elements in RNA virus genomes have, in most cases, been 
shown to be essential for efficient virus replication. Such elements, via interaction with 
viral and/or host factors, function in translation of the Incoming viral RNA, as promoters 
for negative- and positive-strand RNA synthesis, and as signals for selective packaging 
of viral RNAs. A conserved 3' element in the HCV genome, which is likely to be 

20 important for one or more of these processes, presents an attractive therapeutic target. 
Identification of compounds which block interaction of this element with its cognate 
host or viral factors or gene therapy approaches using this element as an RNA decoy 
(Sullenger et al (1990) Cell 63:601-608) in transplanted hepatocytes may prove useful 
in eradicating or controlling chronic HCV infections. 

25 

The present invention demonstrates that there may be an association of the novel 
sequence element and HCV infection. The novel 3' sequence element may be used for 
(i) constructing full-length HCV-H cDNA clones capable of yielding infectious RNA and 
virus (vaccine development, evaluation of therapeutic compounds); (ii) engineering 

30 functional HCV RNA replicons for HCV replication studies and therapeutic evaluation; 
(iii) the determination of 3' NTR sequences for other HCV genotypes and a phyiogenetic 
analysis to define areas of conservation and divergence (nucleic acid based diagnostics 

— for_HCy_de tection); (iv) the improvement of methods for determining the HCV 3' NTR 

sequences and an examination of possible correlations between HCV~3* NTR features 

35 and clinical parameters (disease severity, IFN response, immune status) or tissue 

tropism (predictive diagnostics); (v) determination of the 3' NTR secondary structure 
using chemical modification and RNase mapping and determination f 3D structure by 
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NMR; (vi) structure/function studies on the 3' NTR using the infectious clone (therapy, 
vaccine dev I pment); (vii) definition of h st or viral fact rs which interact with the 
sequence (therapy); (viii) setting up screening assays to identify compounds which 
inhibit the interaction of the element with its cognate host and/or viral factors and to 
-5 test identified compounds for their effects on HCV replication(therapy); (ix) testing the 
conserved 3 # NTR sequence element as a trans-dominant inhibitor of HCV replication 
(gene therapy). 
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WHAT IS CLAIMED IS: 



, A nudeic acid molecule or degenerate variant thereof, which encodes e 3< 
1 • nucle,c f selected fro m the group cons.st.ng 

terminal sequence element, or a fragment thereot. ^ 
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the nucleottae sequent 

cmiiRF 3 (SEQ 10 N0S:20-24); 
the nucleotide sequences of FIGURE 3 («u 



1 
2 

4 (A) the nucleotide sequences of SEQ ID N0S-.1-4: 

xne - - FIGURE 6 (SEQ ID NOS:28-31); 

R IC) the nucleotide sequences of FlGUnt o .oc 

0, the nucleotide sequence of FIGURE 8 (SEQ ID NO:33-36); 

« nudeo.de sequence, that hybrid* • any of the foregoing sequences 

9 U nder standard hybridization -"""^ M substantially sim „ ar 

1 Q (R nucleotide sequences having a seconaary 

11 to the structure of Figure 4. 

, wherein the 3' terminal sequence element 

! 2 . The nucleic acid molecule of Claim 1 , wherein tne 

2 is present in HCV. 



^Q^W«IW> W 

fragment thereof, selected from the group consisting of: 
the nudeotide sequences of SEQ ID NOS 
the nudeotide sequences of FIGURE 3 (S 

— «f figure 6 (SEQ ID N0S:28-31); 
6 (Q the nudeotide sequences of FlGUnt o iscu ^ 



«♦ cmiiRF 3 (SEQ ID N0S:20-241; 
5 (B) the nudeotide sequences of FIGURE 3 l&tu ^ ^ 
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7 (0) the nudeotide sequences of FlGUnt » ^ ^ 



3 ^ ; nudeotjde sequflnces of SEQ ID N0S:1-* 
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xne nuwwwwiw 

« nudeo.de sequences that hybridize to any of the forego.ng sequences 

9 u „der standard hybridization conditions; and 

10 (F) nudeotide sequences havmg a secondary strucx 

11 homologous to the structure of Figure 4. 

_ks non t nhiA or RNA of Claims 1 , 2 or 3, 
, 4 The nudeic add molecule or recombinant DNA or una 

1 4. .nenu« nn -. rat ivelv linked to an expression control 

2 wherein said 3' terminal sequence element .s operatrvely 

3 sequence. 



: — . , th . 3 < termina l sequenc e element of alternate 

, 5 A probe capable of screening for the 3 term.n 

2 suains of HCV prepared from the nudeotide sequence of On 2. 
, 6 . The probe of Claim 5 labeled with a detectable marker. 
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1 7. A host ceil transformed with a recombinant DNA or RNA mol cule comprising a 

2 DNA or RNA s qu nc or degenerate variant ther of, which encodes a 3' terminal 

3 sequence element, or a fragment thereof, selected from the group consisting of: 

4 (A) the nucleotide sequences of SEQ ID N0S:1-4; 

5. (B) the nucleotide sequences of FIGURE 3 (SEQ ID NOS:20-24); - — 

6 (C) the nucleotide sequences of FIGURE 6 (SEQ ID NOS:28-31); 

7 (D) the nucleotide sequences of FIGURE 8 (SEQ ID NOS:33-36); 

8 (E) nucleotide sequences that hybridize to any of the foregoing sequences 

9 under standard hybridization conditions; and 

10 (F) nucleotide sequences having a secondary structure sub >tantially similar 

11 to the structure of Figure 4. 

1 8. The host cell of Claim 7 wherein the host is a human cell in tissue culture. 

1 9. The host cell of Claim 8, wherein the host cell is a liver cell. 

1 10. A method for detecting the presence or activity of the 3' terminal sequence 

2 element of Claim 1, comprising: 

3 A. contacting a biological sample from a mammal in which the 

4 presence or activity of a virus containing the 3' terminal sequence element is suspected 

5 with a binding partner of said 3' terminal sequence element under conditions that allow 

6 binding of said 3' terminal sequence element to said binding partner to occur; and 

7 B. detecting whether binding has occurred between said 3 r terminal 

8 sequence element from said sample and the binding partner; 

9 wherein the detection of binding indicates that presence or activity of said virus 
10 in said sample. 

1 11. The method of Claim 1 0, wherein the virus is HCV. 

1 12. A method for detecting the presence and activity of a virus in mammals 

2 comprising detecting the presence or activity of a 3' terminal sequence element 

3 according to the method of Claim 1 0, wherein detection of the presence or activity of 

4 the 3' terminal sequence element indicates the presence and activity of in said 
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1 13. a method for detecting binding sites for the 3' terminal sequence element of 

2 Claim 1, comprising: 

3 A. placing a labeled binding partner which specifically binds to said 

4 3' terminal sequence element in contact with a biological sample from a mammal in 

5 which binding sites for said 3' terminal sequence element are suspected; and - — 

6 B. examining said biological sample in binding studies for the 

7 presence of said labeled binding partner; 

8 wherein the presence of said labeled binding partner indicates a binding site for 

9 said 3' terminal sequence element. 

1 14. A method of testing the ability of a drug or other entity to modulate the activity 

2 of the 3' terminal sequence element of Claim 1 which comprises: 

3 A. culturing test cells which are infected with a virus or transfected with a 

4 nudeic acid containing said 3' terminal sequence element; 

5 B. adding the drug under test; and 

6 C. measuring the replication of said virus or nucleic acid in said test cells. 

1 15. The method of Claim 14, wherein the virus is HCV or the nucleic acid is an HCV 

2 RNA replicon. 

1 16. An assay system for screening drugs and other agents for ability to modulate 

2 the replication of a virus containing the 3' terminal sequence element of Claim 1 , 

3 comprising: 

4 A. culturing observable test celts inoculated with a drug or agent directed to 

5 the 3' terminal sequence element; 

6 B. harvesting a supernatant from said test cells; and 

7 C. examining said supernatant for the presence of said virus wherein an 

8 increase or a decrease in a level of said virus indicates the ability of a drug to modulate 

9 the replication of said virus. 

1 17. The assay system of Claim 16, wherein the virus is HCV. 

JL__18 : __^niassay system for screening drugs and other agents for ability to modulate 

2 the replication of a virus containing th^3'Terminal sequence-element-otClaim_l , 

3 comprising: 

4 A. culturing observable test cells inoculated with a drug or agent directed to 

5 the 3' t rminal sequ nee elem nt; 
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6 B. contacting said test cells with a labelled binding partner of r a primer 

7 which will amplify the genome f said virus; and 

8 C. determining the level of viral genome in the test cells wherein an 

9 increase or a decrease in a level of said genome indicates the ability of a drug to 

10 * modulate the replication of said virus. ~ ~* 

1 19. The assay system of Claim 18, wherein the virus is HCV. 

1 20. A test kit for the demonstration of a virus in a eukaryotic cellular sample, 

2 comprising: 

3 A. a predetermined amount of a detectably labelled specific binding 

4 partner of the 3' terminal sequence element of Claim 1 ; 

5 B. other reagents; and 

6 C. directions for use of said kit. 

1 21. The test kit of Claim 20, wherein the virus is HCV. 

1 22. A method of preventing and/or treating cellular debilitations, derangements 

2 and/or dysfunctions and/or other disease states in mammals caused by a virus 

' 3 containing the 3' terminal sequence element of Claim 1 , comprising administering to a 

4 mammal a therapeutically effective amount of a material selected from the group 

5 consisting of the 3' terminal sequence element, its complement or a fragment thereof, 

6 an agent capable of promoting the production and/or activity of said 3' terminal 

7 sequence element, an agent capable of mimicking the activity of said 3' terminal 

8 sequence element, an agent capable of inhibiting the activity of said 3' terminal 

9 sequence element, and mixtures thereof, or a specific binding partner thereto. 

1 23. The method of Claim 22, wherein the virus is HCV. 

1 24. The method of Claim 22 wherein said disease states include hepatitis, cirrhosis, 

2 and hepatocellular carcinoma. 

— 1 — 25= Thej-nethod of Claim 22 wherein said 3' terminal sequence element, its 

2 complement or a fragment thereof, an agent capabreof-promoting-theproductio^^ 

3 activity of said 3' terminal sequence element, an agent capable of mimicking the activity 

4 of said 3' terminal sequence element, an agent capable of inhibiting the activity of said 

5 3' terminal sequence element, and mixtures thereof, r a specific binding partner ther to 
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o nf theraov wherein a primary therapeutic agent 
is administered to modulate th course of therapy 

is also being administered. 

26 The method of Cairn 22 wherein said 3' terminal sequence element, its 
26. The method ^ ^ pfoduct(0(V|ind/or 

component or a fragment thereof, an gent P ^ ^ 

• ^* « a iH *v terminal sequence element, an agem ua H «» 

7 as the primary therapeutic agent. 

T „ -^nd of Claim 22 wherein said 3' terminal sequence element, its 
27 . The method of Cla m 22 w g ^ ^ 

comp.ement or a fragment thereof, an age« cap ^ ^ 

. ac^ of said 3. terminal ^-~J^ P ^ ^ . f ^ 

4 of said 3' terminal sequence ^ZTLl or a specific binding partner thereto 

5 3' terminal sequence element, and «« ^ 

6 is administered to modulate the course of therapy where tw 
therapeutic agents are being co-administered. 

l. ^ 4 rinim 22 wherein said 3' terminal sequence element. Its 
» The method of Oalm 22 where. ^ ^ 

compiement or a fragment thereof, an ^ ^ P of m . |micldng ^ activity 
activi ty of said , terminal ^ ^^L of inhibmng *. «*» of said 
of ^d 3- terminal sequence ^ ™lao f . or a specific binding partner thereto 
3' terminal sequence element, and mixtures thereof, or 

. ZL-. » — - - — - *— — l ™ rfeo " " *- ~ 

29 A phannKeutt* 3 "' composition for the treatment ci cellular debQitatlon, 

• • If the 3' terminal sequence element of Claim 1. its complement or a 
„.„m B nt an agent capable of mimicking the activity of said 3 
"™ . „, m^res ttoeof. o. . «»*«= **»» I-™* *— ! "* 
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B. a pharmaceutically acceptable carrier. 
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1 30. An attenuated virus comprising the 3' terminal sequence element of Claim 1 in 

2 combination with a sequence which is substantially homologous to that of HCV-1, HC- 

3 J1„ HC-J, HCV-BK, HCV-H. HC-J6, HC-J8, HC-J483, HC-J491. HC-C2, HCV-JK, HCV- 

4 N, HCV-T, HCV-JT, HC-G9, HCV-K3a, NZL1, or HCV-Tr, which virus is replication- 
& impaired. " ~ 

1 31 . An antisense nucleic acid against a viral genome RNA or its complement 

2 comprising a nucleic acid sequence hybridizing to one of the nucleic acid molecules of 

3 SEQ ID NOS:1-12, 20-24, 28-31 and 33-36 or their complements. 

1 32. The antisense nucleic add of claim 31 which comprises an oligonucleotide 

2 modified to enhance stability, uptake or targeted intracellular degradation. 

1 33. The antisense nucleic acid of Claim 31 which is RNA. 

1 34. The antisense nucleic acid of Claim 31 which is DNA. 

1 35. A recombinant DNA or RNA molecule having a DNA or RNA sequence which, on 

2 transcription, produces an antisense ribonucleic acid against a viral genome RNA or its 

3 complement, said antisense ribonucleic acid comprising a nucleic acid sequence which 

4 hybridizes to one of the nucleic acid molecules of SEQ ID N0S:1-4, 20-24, 28-31 and 

5 33-36 or their complements. 

1 36. A virus-producing cell line transfected with the recombinant DNA or RNA 

2 molecule of Claim 35. 

1 37. A ribozyme that cleaves the viral mRNA of Claim 35. 

1 38. The ribozyme of Claim 37 which is a Tetrahymena-type ribozyme. 

1 39. The ribozyme of Claim 37 which is a Hammerhead-type ribozyme. 

~1 40; A-recombinantJDNA molecule having a DNA sequence which, upon transcription, 

2 produces the ribozyme of Claim 37. — — ~_ 
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FIG. 3 



H7 7 - # 1 . 2 GTGTCTCATGCCCGGCCCCGCTGGTTCTGGTTTTGCCTACTCCTGCTCAC 

H7 7- # 8 GTGTCTCATGCCCGGCCCCGCTGGTTCTGGTTTTGCCTACTCCTGCTCGC 

H 7 7 - 1 1 0 GTGTCTC ATGCCCGGCCCCGCTGGTTCTGGTTTTGCCTACTCCTGCTCGC 

H77-I74 GTGTCTCATGCCCGGCCCCGCTGGTTCTGGTTTTGCCTACTCCTGCTCGC 

H77-I5 GTGTCTCATGCCCGGCCCCGCTGGTTCTGGTTTTGCCTACTCCTGCTTGC 

H7 7 - # 1 . 2 TGCAGGGGTAGGTATCTACCTCCTCCCCAACCGATGAAGGTTGGGGTAAA 

H77- f 8 TGCAGGGGTAGGCATCTACCTCCTCCCCAACCGATGAAGGTTGGGGTAAA 

H7 7 - f 1 0 TGCAGGGGTAGGdATCTACCTCCTCCCCAACCGATGAAGGTTGGGGTAAA 
H77-1 74 • TGCAGGGGTAGG5ATCTACCTCCTCCCCAACCGATGAAGGTTGGGGTAAA 

H77- #5 TGCAGGGGTAGGCATCTACCTCCTCCCCAACCGATGAAGGTTGGGGTAAA 

H77- fl.2 CACTCCGGCCTCTTAGGCCATTTCCTGTTTTTTTTTTTTTTTTTTTT 

H77- | 8 CACTCCGGCCTCTTAGGCCATTTCCTGTTTTTTTTTTTTTTTTTTTTTTT 

H7 7- # 1 0 CACTCCGGCCTCTTAGGCCATTTCCTGTTTTTTTTTTTTTTTTTTTTTTT 

H7 7- 1 7 4 CACTCCGGCCTCTTAGGCCATTTCCTGTTTTTTTTTTTTTTCCCTTTTTT 

H7 7- # 5 CACTCCGGCCTCTTAGGCCATTTCCTGTTTTTTTTTTTTTTTTTTTTjTTT 

H77-#1.2 

H77-f8 TTTTTTTTTTT 

H77-# 10 ^TTTTTTTTTTTTTTTTTTTT TTTTTTTCTTTTTTTTTTTTTTTTTCCTT 

H77-I74 TTlTTirn TTT TTTTTTTT1 TT1 TT1 TTTT 

H77-I5 TTT TTTrTTTTTTTTTTTTTTTTT TTT TTTT 

H77-#1.2 CTTTCCTTCTTTTTT . . CCTTTCTTTTCCTTC 

H77- f 8 , CTTTTCCTTCTTTTTC . . CCTTTTTCTTTCTTC 

H7 7 - 1 1 0 TTTTTTTTTTTTTTTTTTCTTTCCTTCTTTTTT . . CCTTTCTTTTCCTTC 

H77-I74 CTTTCCTTCTTTTTTTTCCTTTCTTTTCCTTC 

H77-I5 CTTTCCTTCCTTTTCC . CTTTTCTTTTC . TTC 

H77-I1 .2 CTTCTTTAATGGTGGCTCCATCTTAGCCCTAGTCACGGCTAGCTGTGAAA 

H77- # 8 CTTCTTTAATGGTGGCTCCATCTTAGCCCTAGTCACGGCTAGCTGTGAAA 

H7 7- ? 1 0 CTTCTTTAATGGTGGCTCCATCTTAGCCCTAGTCACGGCTAGCTGTGAAA 

H7 7 - f 7 4 CTTCTTT AATGGTGGCTCCATCTTAGCCCTAGTCACGGCTAGCTGTGAAA 

H7 7- f 5 CTTCTTT AATGGTGGCTCCATCTTAGCCCTAGTCACGGCTAGCTGT 

H7 7 - # 1 . 2 GGTCCGTGAGCCGCATGACTGCAGAGAGTGCTGATACTGGCCTCTCTGCA 
-H7J^#8__G^TCCGTGAGCCGCATGACTGCAGAGAGTGCTGATACTGGCCTCTCTGCA 

H77- f 10 GG^CCGTGAGCtsGGA-TGACTGCAGAG^GTGCTGATACTGGCCTCTCCGCA 

H7 7 - 1 7 4 GGTCCGTGAGCf iGCATGACTGCAG^GAGTGCTGATACTGGGGTGTCTGCT_. 
H77-S5 

H77-U.2 GATCATGT 

H77-#8 GATCATGT 

H77-fl0 GATCATGT 

H77-I74 GATCATGT 

H77-I5 
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FIG. 6 

1 50 
lb CTGTCTCGTG CCCGACCCCG CTGGTTCATG TTGTGCCTAC TCCTACTTTC 
3 GTGTCACGTG CCCGAACCCG CTATTTGCTG CTTTGCCTAC TUCTACTAAC 
4? GTGTCCCATG CCCGACCCCG CTATCTACTC CTGTGCCT AC TCCTACTTTC 
4a ATGTCTCATG CCCGACCCCG CTATTTACTC CTGTGCCTAC TCCTACTTAC 



51 100 
lb CGTAGGGGTA GGCATCTACC TGCTCCCCAA CCGATGAA. . CGGGGAGCTA 

3 GGTAGGGGTA GGCATCTTTC TCCTGCCAGC GCGATGAGCT GGTAGG.ATA 
4? CGTAGGGGTA GGCATCTTCC TGCTGCCTGC TCGATAGGCA .GCT TA 

4 a AGTAGGGGTA GGCATCTTCC TGCTGCCTGC TCGGTAGGCA GCT TA 



101 



150 



lb ACACTCCAGG CCAATAGG.C AT.-CCTGTT TTTTTTTTTT TTTTTTTTTT 

3 ACACT CC ATTTCTTTTT TTGTTTTTTT TTTTTTTTTT 

4? ACACTCCGAC CT . . TAGGGT CCTTC.TGTT TTTTTTTTTT TTTTTTTTTT 

4a ACACTCCGAC CT.. TAGGGT CCCCT.TGTT TTTTTTTTTT 



lb 

3 

4? 
4a 



151 200 

tp *r t^^i^t^ ^jp^j^^t ^r^r*r *r t^^^ ^n^^r ^n^^r *r^r*nr ^n^^p ^tt^ ^i^^t^p 

TT^ TTTTT^P^^^TT^ ^nji^njiiji<j*ijnjijnjt 



201 250 
lb CTTTTCTTT 

3 ...CTTTTTC TTTCCTTTCT TTTCTGACTT CTAATT..TT CCTTCTT . . A 

4 ? TTTTTTCCTT ACCCTTTCCT TCTTTTCTTC CTTTTTTTTC CTTACTTT . . 
4a C TTTCCTTCTT T...CCTTTC CTAAT.CTTT CTTTCTT. . . 



251 

"115 GGTGGC-TGGA— TCT-TAGC£CX_AGTCACGGCT A 

3 GGTGGCTCCA TCTTAGC^C^ r AGTCACGGCT~A~ 
4? GGTGGCTCCA TCTTAGCCCT AGTCACGGCT A 
4a GGTGGCTCCA TCTTAGCCCT AGTCACGGCT A 
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